Under consideration for publication in Theory and Practice of Logic Programming 



1 



Multi-valued Action Languages 
with Constraints in CLP(FD) * 

AGOSTINO DOVIER 

University di Udine, 
Dipartimento di Matematica e Informatica 
(e-mail: dovier@dimi . uniud . it) 

ANDREA FORMISANO 

University di Perugia, 
Dipartimento di Matematica e Informatica 
{e-mail: f ormis@dmi . unipg . it) 

ENRICO PONTELLI 

New Mexico State University, 
Department of Computer Science 
{e-mail: epontell@cs . nmsu . edu) 

submitted 5 January 2009; revised 9 October 2009; accepted 10 December 2009 



Abstract 

Action description languages, such as A and B (Gelfond and Lifschitz 1998 1, are expressive instru- 
ments introduced for formalizing planning domains and planning problem instances. The paper starts 
by proposing a methodology to encode an action language (with conditional effects and static causal 
laws), a slight variation of B, using Constraint Logic Programming over Finite Domains. The ap- 
proach is then generalized to raise the use of constraints to the level of the action language itself. 
A prototype implementation has been developed, and the preliminary results are presented and dis- 
cussed. 

To appear in Theory and Practice of Logic Programming (TPLP). 

KEYWORDS: Action Description Languages, Knowledge Representation, Planning, Constraint Log- 
ic Programming 



1 Introduction 

The construction of intelligent agents that can be effective in real-world environments has 
been a goal of researchers from the very first days of Artificial Intelligence. It has long 
been recognized that an intelligent agent must be able to acquire, represent, and reason 
with knowledge. As such, a reasoning component has been an inseparable part of most 
agent architectures in the literature. 



* This manuscript is an extended version of the paper "Multi-valued Action Languages with Constraints 
in CLP(FD)" in the Proceedings of the International Conference on Logic Programming, pages 255-270, 
Springer Verlag, 2007. 
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Although the underlying representations and implementations may vary between agents, 
the reasoning component of an agent is often responsible for making decisions that are 
critical to its existence. 

Logic programming languages offer many properties that make them very suitable as 
knowledge representation languages. Their declarative nature supports the modular devel- 
opment of provably correct reasoning modules dBaral 20031) . Recursive definitions can be 
easily expressed and reasoned upon. Control knowledge and heuristic information can be 
declaratively and incrementally introduced in the reasoning process. Furthermore, many 
logic programming languages offer a natural support for non-monotonic reasoning, which 
is considered essential for common-sense reasoning ( Lifschi tz" 19991 l. These features, along 



with the presence of efficient inference engines (Apt 2003 Marriott and Stuckey 1998 



Simons 20001 Giunchiglia et al. 2004b Geb ser et al. 20071 . make logic programming an 



attractive paradigm for knowledge representation and reasoning. 

In the context of knowledge representation and reasoning, a very important applica- 
tion of logic programming has been in the domain of reasoning about actions and change 
and, more specifically, planning. Planning problems have been effectively encoded using 
Answer Set Programming (ASP) (Baral 2003) — where distinct answer sets represent dif- 
ferent trajectories leading to the desired goal. Other logic programming paradigms, e.g., 
Constraint Logic Programming over Finite Domains (CLP(FD)) (Jaffar and Mah er 19941 
Apt 2003) , have been used less frequently to handle problems in reasoning about ac- 



tions (e.g., ( Rei ter 200 1 1 [Thielscher 2002a)). Comparably more emphasis has been placed 
in encoding planning problems as (non-logic programming) constraint satisfaction prob- 



lems ( Lopez and Bacchus 2003 ) 



Recent proposals on representing and reasoning about actions and change have relied on 
the use of concise and high-level languages, commonly referred to as action description 
languages; some well-known examples include the languages A and B dGelfond and Lifschitz 19981 ) 
and extensions like K, dEiter et al. 20041 ) and ADC (Ba ral et al. 20021) . Action languages al- 
low one to write propositions that describe the effects of actions on states, and to create 
queries to infer properties of the underlying transition system. An action domain descrip- 
tion is a specification of a planning domain using an action language. 

The goal of this work is to explore the relevance of constraint solving and constraint logic 
programming (Marr iott and Stuckey 1998j|Apt 2003] ) in dealing with action languages and 
planning. The push towards this exploratory study came from recent investigations ( Do vier et al. 2 005 1 
IDovier et al. 200 9a) aimed at comparing the practicality and efficiency of answer set pro- 
gramming versus constraint logic programming in solving various combinatorial and op- 
timization problems. The study indicated that CLP offers a valid alternative, especially 
in terms of efficiency, to ASP when dealing with planning problems. Furthermore, CLP 
offers the flexibility of programmer-developed search strategies and the ability to handle 
numerical constraints. 

The first step, in this paper, is to illustrate a scheme that directly processes an action 
description specification, in a language similar to B dGelfond and Lifs chitz 1998 1, pro- 
ducing a CLP(FD) program that can be used to compute solutions to the planning prob- 
lem. Our encoding has some similarities to the one presented by Lopez and Bacchus 



(Lopez and Bacchus 2003 ), although we rely on constraint logic programming instead of 
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plain constraint satisfaction (CSP), and our action language supports static causal laws 
and non-determinism — while the work of Lopez and Bacchus is restricted to STRIPS-like 
specifications. 

While the first step relies on using constraints to compute solutions to a planning prob- 
lem, the second step brings the expressive power of constraints to the level of the action 
language, by allowing multi-valued fluents and constraint-producing actions to be used in 
the domain specification. The extended action language (named B MV ) can be as easily 
supported by the CLP(FD) framework, and it allows a declarative encoding of problems 
involving actions with resources, delayed effects, and maintenance goals. These ideas have 
been developed in a prototype, and some preliminary experiments are reported. 

We believe that the use of CLP(FD) can greatly facilitate the transition of declarative ex- 
tensions of action languages to concrete and effective implementations, overcoming some 
inherent limitations (e.g., efficiency and limited handling of numbers) of other logic-based 
systems (e.g., ASP). 

The presentation is organized as follows. The first part of our paper (Sections [2] and [3) 
provides an overview of the action language B and illustrates our approach to modeling 
problem specifications in B using constraints and constraint logic programming. Section|4] 
provides motivations for the proposed multi-valued extensions. Section [5] introduces the 
full syntax of the new language B MV . The action language B MV expands the previous 
language to a language with constraints and multi-valued fluents, that enables the use of 
dynamic and static causal laws (a.k.a. state constraints), executability conditions, and non- 
Markovian forms of reasoning with arbitrary relative or absolute references to past and 
future points in time. The semantics and the abstract implementation of B MV is incre- 
mentally developed in Section [6j where we first consider a sub-language not involving 
non-Markovian references, and later we extend it to the full B MV . A concrete implemen- 
tation in CLP(FD) is described in Section [7] and an experimental evaluation is discussed 
in Section [H] Section [9] presents an overview of related efforts appeared in the literature, 
while Section[lO]presents conclusions and the directions for future investigation. 



2 The Action Language B 

"Action languages are formal models of parts of the natural language that are used for 
talking about the effects of actions" (Gelf ondand Lifschitz 1998b . Action languages are 
used to define action descriptions that embed knowledge to formalize planning problems. 
In this section, we use the same variant of the language B used in dSon et al. 2 001 ) — see 
also Section [9] for a comparison. With a slight abuse of notation, we simply refer to this 
language as B. 



2.1 Syntax of B 



An action signature consists of a set T of fluent names, a set A of action names, and 
a set V of values for fluents in T . In this section, we consider Boolean fluents, hence 
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V = {0, A fluent literal is either a fluent / or its negation neg(f). Fluents and actions 
are concretely represented by ground atomic formulae p(t\, . . . ,t m ) from an underlying 
logic language C. For simplicity, we assume that the set of terms is finite — e.g., either there 
are no function symbols in C, or the use of functions symbols is restricted, for instance by 
imposing a fixed maximal depth on the nesting of terms, to avoid the creation of arbitrary 
complex terms. 

The language B allows us to specify an (action) domain description V. The core com- 
ponents of a domain description are its fluents — properties used to describe the state of 
the world, that may dynamically change in response to execution of actions — and ac- 
tions — denoting how an agent can affect the state of the world. Fluents and actions are 
introduced by assertions of the forms fluent (/) and action (a) . An action descrip- 
tion T> relates actions, states, and fluents using axioms of the following types — where 
[ list-of -conditions] denotes a list of fluent literals!! 

• causes (a, £, [ list-of-conditions ]): this axiom encodes a dynamic causal 
law, describing the effect (i.e., truth assignment to the fluent literal I) of the execution 
of action a in a state satisfying the given conditions 

• caused ( [list-of-conditions ] , I): this axiom describes a static causal law — 
i.e., the fact that the fluent literal £ is true in any state satisfying the given precondi- 
tions. 

Moreover, preconditions can be imposed on the executability of actions by means of asser- 
tion of the forms: 

• executable (a, [list-of-conditions] ): this axiom asserts that, for the ac- 
tion a to be executable, the given conditions have to be satisfied in the current state. 

A domain description is a set of static causal laws, dynamic laws, and executability 
conditions. A specific planning problem (V, O) contains a domain description T> along 
with a set O of observations describing the initial state and the desired goal: 

• initially (t) asserts that the fluent literal £ is true in the initial state 

• goal (£) asserts that the goal requires the fluent literal £ to be true in the final state. 

In the specification of an action theory, we can take advantage of a Prolog-like syntax to 
express in a more succinct manner the laws of the theory. For instance, to assert that in the 
initial state all fluents are true, we can simply write the following rule: 

initially(F) :- fluent(F). 

instead of writing a fact initially (/) for each possible fluent /. Remember that the 
notation H : — B\ , . . . , B}. is a syntactic sugar for the logical formula 

where X%, . . . , X n are all the variables present in H, B\, . . . , B).. 

1 For simplicity, we use to denote false and 1 to denote true. Consequently, we often say that a fluent is true 
(resp., false) if its value is 1 (resp., 0). 

2 We will sometimes write true as a synonymous for the empty list of conditions. 
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%% Some Type Information 

barrel ( 5 ) . 

barrel (7) . 

barrel (12) . 

liter (0) . 

liter (1) . 



liter (12) . 

%% Identification of the fluents 

fluent (cont (B, L) ): - barrel (B) , liter(L), L < B. 
%% Identification of the actions 

action (fill (X, Y) ): - barrel (X) , barrel (Y) , X ^ Y. 
%% Dynamic causal laws 

causes (fill (X, Y) , cont(X,0), [cont(X.LX), cont (Y, LY) ] ) :- 

action (fill (X, Y) ) , fluent (cont (X, LX) ) , 

fluent (cont (Y, LY) ) , Y-LY > LX . 
causes (fill (X, Y) , cont (Y, LYnew) , [cont(X.LX), cont (Y, LY) ] ) 

action (fill (X, Y) ) , fluent (cont (X, LX) ) , 

fluent (cont (Y, LY) ) , Y-LY > LX, LYnew is LX+LY . 
causes (fill (X, Y) , cont (X, LXnew) , [cont(X,LX), cont (Y, LY) ] ) 

action (fill (X, Y) ) , fluent (cont (X, LX) ) , 

fluent (cont (Y, LY) ) , LX > Y-LY, LXnew is LX-Y+LY . 
causes (fill (X, Y) , cont(Y,Y), [cont(X,LX), cont (Y, LY) ] ) :- 

action (fill (X, Y) ) , fluent (cont (X, LX) ) , 

fluent (cont (Y, LY) ) , LX > Y-LY. 

%% Executability conditions 

executable (fill (X, Y) , [cont(X,LX), cont (Y, LY) ] ) :- 
action (fill (X, Y) ) , fluent (cont (X, LX) ) , 
fluent (cont (Y, LY) ) , LX > 0, LY < Y. 

%% Static causal laws caused ( [cont (X, LX) ] , neg (cont (X, LY) ) ) 
fluent (cont (X, LX) ) , fluent (cont (X, LY) ) , 
barrel (X), liter (LX), liter (LY), LX^LY. 

%% Description of the initial and goal state 

initially (cont (12, 12) ) . 

initially (cont (7,0)) . 

initially (cont (5,0)) . 

goal (cont (12, 6) ) . 

goal (cont (7,6)). 

goal (cont (5,0)). 

Fig. 1. B description of the 12-7-5 barrels problem. 



Example 1 

Figure [T] presents an encoding of the three-barrel planning problem using the language B. 
There are three barrels of capacity N (an even number), iV/2+1, and N/2—1, respectively. 
At the beginning, the largest barrel is full of wine while the other two are empty. We wish 
to reach a state in which the two larger barrels contain the same amount of wine. The only 
permissible action is to pour wine from one barrel to another, until the latter is full or the 
former is empty. Figure [T] shows the encoding of the problem for N = 12. Notice that we 
also require that the smallest barrel is empty at the end. □ 



2.2 Semantics of B 



If / G T is a fluent, and S is a set of fluent literals, we say that S \= f if and only if / e S 
and S \= neq(f) if and only if neg(/) E S. A list of literals L — [£%, . . . , £ m ] denotes 
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a conjunction of literals, hence S \= L if and only if S \= li for all i £ {1, . . . , m}. 
We denote with the set {/ 6 T : neg(/) £ S} U {neg(/) : f £ S D T}. A 
set of fluent literals is consistent if there is no fluent / s.t. S \= f and S \= neg(/). 
If S U -iS D then S is complete. A set 5 of literals is closed w.r.t. a set of static 
laws SC = {caused(Ci,£i), . . . , caused(C m , £ m )}, if for all i £ {1, . . . , m} it holds 
that S \= Ci implies S \= li. The set Closc{S) is defined as the smallest set of literals 
containing S and closed w.r.t. SC. Closc(S) is uniquely determined and not necessarily 
consistent. 

The semantics of an action language on the action signature (V, J 7 , A) is given in terms 
of a transition system (S,v,R) (Gelf ond and Lifschitz 1998b , consisting of a set S of 
states, a total interpretation function v : T x S — > V (in this section V = {0, 1}), and 
a transition relation R C S x A x S. 

Given a transition system (S, u, R) and a state s £ S, let: 

Lti(s) - {f £ T : f(f, s) = 1} U {neg(/) : f £ T , v(f, s) = 0}. 

Observe that Lit(s) is consistent and complete. 

Given a set of dynamic laws {causes(a, l\, C\), . . ., causes(a,i m , C m )} for the ac- 
tion «ei and a state s £ S, we define the (direct) effects of a in s as follows: 

E(a, s) = {U : 1 < i ^ m, Lit(s) |= CJ. 

The action a is said to be executable in a state s if it holds that 

h 

Lit(s) h V/ C - W 

i=l 

where executable(a, C\), . . ., executable(a, Ch) for h > 0, are the executability ax- 
ioms for the action a in P. Observe that multiple executability axioms for the same action 
a are considered disjunctively. Hence, for each action a, at least one executable axiom must 
be present in the action description!! 

Let V be an action description defined on the action signature (V, J-, A), composed of 
dynamic laws T>£, executability conditions ££, and static causal laws SC 

The transition system (S, v, R) described by V is a transition system such that: 

• S is the set of all states s such that Lit(s) is closed w.r.t. SC; 

• R is the set of all triples (s, a, s') such that a is executable in s and 

Lit(s') = Clo sc {E{a 1 s)U(Lit(s)nLit{s'))) (2) 

Let (V, O) be a planning problem instance, where {£ | initially(£) £ O} is a consis- 
tent and complete set of fluent literals. A trajectory in (S, v, R) is a sequence 

(so, 01,31,02, ■ ■ ' , ON, Sn) 

such that (si,a i+ i,Si + i) £ R for alH € {0, . . . , N — 1}. 

3 Observe that even if an action is "executable", its execution may lead to an inconsistent state (which effec- 
tively prevents the use of such action in that context). Even though "enabled" would be a better term to use 
for an action that can be executed in a state, we prefer to maintain the same terminology as used for B in 
<Son et al. 200T ) — see also Remark[2] 



Multi-valued Action Languages in CLP(FD) 



7 



A sequence of actions (oi, . . . , a\) is a solution (a plan) to the planning problem (T), O) 
if there is a trajectory (so, Oi, Si, . . . , on, sn) in (S, u, R) such that: 

• Lit(so) |= r for each initially(r) G O, and 

• Lit(sw) (= £ for each goal(^) G C 

The plans characterized in this definition are sequential — i.e., we disallow concurrent ac- 
tions. Observe also that the desired plan length N is assumed to be given. 

Remark 1 

In this paper we focus on sequential plans only. Hence, we assume that only one action is 
executed in each state transition composing a given trajectory. 

Note that the constraint-based encoding we will propose in the rest of this manuscript 
can be easily adapted to deal with concurrent actions. Nevertheless, we have opted to ignore 
this aspect in this manuscript, to avoid further complications of notation, and dealing with 
issues of concurrency goes beyond the scope of this paper. The interested reader is referred 
to dDovier et al. 2009bt for some further considerations on this matter. 

Remark 2 

Notice that the satisfaction of (TJ is just a necessary requirement for the executability of an 
action and it might not represent a sufficient precondition. Indeed, as far as the definition 
of transition system is considered, it is easy to see that, even if (Q]l is satisfied for certain a 
and s, the execution of a in s might be inhibited because of the contradictory effects of the 
causal laws. A simple example is represented by the following action description T>: 

executable (a, []) . 
causes (a, f , [ ] ) . 
causes (a, neg (/) , [ ] ) . 

The action a is always executable (according to its executability law), but the execution of 
a would yield an inconsistent situation. Indeed, the execution of a does not correspond to 
any state transition in the transition system described by V. 

The above example also suggests a possible extension of the action description language 
that involves laws of the form 

nonexecutable (a, D) . 

The semantics for such an extended action language can be defined by replacing the con- 
dition ([T), with the following one: 

h k 

Lit(s) hV^A-V^ 
i=i j=i 

where executable (a, Ci ),..., executable (a, Ch) and nonexecutable (a, D\) , 
. . ., nonexecutable (a, Dk ) , for h > and k 0, are defined for the action a. Thus, the 
action a is executable only if at least one of the CjS is satisfied and all DjS are unsatisfied 
in the state s. 

An alternative interpretation of the nonexecutable axioms can be adopted. Namely, 
the law nonexecutable (a, D) can be considered simply as shorthand for the pair of 
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FromState ToState 




Fig. 2. Action constraints from state to state. (The states are described by p fluents, 
£i, . . . , £ p , and one among m possible actions is executed.) 

dynamic causal laws causes (a, /, D) and causes (a, neg(/), D) . (Actually, this 
possibility also applies to the languages proposed in (Gel fond and Lifschitz 19 98 )). 

This shows that (non)executability laws do not increase the expressive power of the 
action language. Nevertheless, the availability of both types of laws permits the direct and 
explicit formalization of preconditions for actions execution. 

3 Modeling B and Planning Problems Using Constraints 

Let us describe how action descriptions are mapped to finite domain constraints. We will 
focus on how constraints can be used to model the possible transitions from each individual 
state of the transition system. 

3.1 Modeling an Action Theory as Constraints 

Let us consider a domain description T> and the state transition system described by V. 
Let us also denote with u and v the starting and ending states of an arbitrary transition of 
such a system. We assert constraints that relate the truth value of fluents in u and v. This is 
intuitively illustrated in Figured where u = FromState and v = ToState@ 

A Boolean variable is introduced to describe the truth value of each fluent literal in a 
state. The value of a fluent literal I in u is represented by the variable FT; analogously, 
its value in the destination state v is represented by the variable F«. For the sake of sim- 
plicity, we will freely refer to these variables as Boolean entities — and compose them with 
logical connectives to form Boolean expressions — as well as 0/1 variables — and compose 
them with arithmetic operators. Concrete CLP(FD) systems, e.g., SICStus, ECLiPSe, and 
BPrologH enable this type of alternative perspectives, providing basic primitive constraints 
(e.g., #= and #>) and Boolean compositions of constraints. 

Given a conjunction of literals a — [li , . . . , £ m ] we will denote with a u the expression 
Ff A . . . AF" . We will also introduce, for each action a, a Boolean variable ^4", represent- 
ing whether the action is executed or not in the transition from u to v under consideration. 

4 For the sake of readability, the two variables named FromState and ToState are also used in the concrete 
implementation of B (cf., Section [33l and Figure|6j. 

5 Web sites for some CLP(FD) systems. SICStus: www.sics.se/sicstus.html 
ECLiPSe: [http: 7/87 .230 .22 .228/1 BProlog; [http : / /www . probp ■ com71 



Multi-valued Action Languages in CLP(FD) 



9 



Dyn £ 


m r 


(3) 


Stat? 


<-> V T« 
j'=i 


(4) 


Fired £ 


<-> Dyn^ V Stat/ 


(5) 




-iFired £ V-iFiredj 


(6) 




<-► Fired?'* V (-.Fired]?'" A F?) 


(7) 



Fig. 3. The constraint C"' v for the fluent literal £ (cf., Section IXTT i. 

Given a specific fluent literal £, we develop constraints that determine when FJ is true 
and false. Let us consider the dynamic causal laws that have £ as a consequence: 

causes(fli (1 ,f,Qf ! i) ••■ causes(a^_ m ,i, a^.mj 

Let us also consider the static causal laws related to £ 

caused(7^i, £) ■■■ caused(7^ j / t{ , £) 

Finally, for each action a we will have its executability conditions: 

executable(a, (5 a> i) ••• executable(a, <5 a ,p a ) 

Figure [3] describes the Boolean constraints that can be used in encoding the relations that 
determine the truth value of the fluent literal £. In the table, we denote with £ the comple- 
ment of literal £, i.e., if £ is the fluent /, then £ is neg(/), while if £ is the literal neg(/) 
then £ is the fluent /. The intuitive meaning of the constraints is as follows: 

(fj) This constraint states that dynamic causal laws making £ true can fire if their conditions 
are satisfied and the corresponding actions are chosen for execution. 

(|4) This constraint captures the fact that at least one of the static causal laws that make / 
true is applicable. 

(H) This constraint expresses the fact that a fluent literal £ can be made true during a 
transition form state u to state v, either by a dynamic causal law (determined by Dyn") 
or a static causal law (determined by Stat]?). 

© This constraint is used to guarantee consistency of the action theory — in no situations 
a fluent and its complement are both made true. 

© This constraint expresses the fact that a fluent literal I is true in the destination state 
if and only if it is made true (by a static or a dynamic causal law) or if is true in the 
initial state and its truth value is not modified by the transition (i.e., inertia). Observe the 
similarity between this constraint and the successor state axiom commonly encountered 
in situation calculus (Levesque et al. 1998). 

We will denote with C"'" the conjunction of such constraints. 

Given an action domain specification over the signature (V, A) and two states u and 
v, we introduce the system of constraints Cp v which includes: 

• for each fluent literal £ in the language of T, the constraints C"'". 
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• the constraint 



• for each action a € A, the constraints 



K -> V <r 

Notice that the sequentiality of the plan if imposed through the constraint ©, while con- 
straint (O reflects actions' executability conditions. 



3.2 Soundness and Completeness Results 

Let us proceed with the soundness and completeness proofs of the constraint-based en- 
coding. Consider a state transition from the state u to the state v and the corresponding 
constraint Cfi' v described earlier. 

Let S — Lit(u) and S 1 = Lit(v) be the sets of fluent literals that hold in u and v, respec- 
tively. Note that, from any specific S (resp., S'), we can obtain a consistent assignment as 
(resp., as') of truth values for all the variables FJ (resp., FJ) of u (resp., v). Conversely, 
each truth assignment as (resp., 175/) for all variables (resp., FJ) corresponds to a 
consistent and complete set of fluents S (resp., S'). 

Regarding the occurrence of actions, recall that in each state transition a single action a 
occurs and its occurrence is encoded by a specific Boolean variable, A". Let a a denote the 
assignment of truth values for such variables such that a a (A^) = 1 if and only if a occurs 
in the state transition from u to u[^Note that the domains of as, as>, and a a are disjoint, 
so we can safely denote with as ° as> ° a a the composition of the three assignments. With 
a slight abuse of notation, in what follows we will denote with E the direct effects E(a, u) 
of an action a in u. Observe that E C S 1 . 

Theorem[T]states the completeness of the system of constrains introduced in Section lXTl 
It asserts that for any given V = (T>£, ££, SC), if a triple (u, a, v) belongs to the transition 
system described by V, then the assignment a = as °as> °a a satisfies the constraint C^ v . 

Theorem 1 {Completeness) 

Let T> = (T>£, ££, SC). If (u, a, v) belongs to the transition system described by T>, then 
as o as> o a a is a solution of the constraint Cp v . 

Proof 

In constraints ©-(IT) of Figure[3]and dHJ-© defined at the end of Subsection l3.ll a number 
of auxiliary constraint variables are defined, whose values are uniquely determined once 
the values of the fluents are assessed. In other words, when S, S', and a are fixed, the 
substitution 0-5 o 0-5/ o a a uniquely determines the value of the right-hand sides of the 
constraints |3j-((5j. To prove the theorem, we need to verify that if S' = Clogc(E U(Sn 
S')), then the constraints (|6]) and ^} along with the constraints about the action variables 
A™ (i.e., constraints of the form (O and (0) are satisfied for every fluent /. 



6 We will use mapping applications either as u(X) or in postfix notation as Xa. 
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Let us observe that ^ is equivalent to say that if A a is true ( A a = 1) then Ab is false for 
all b a. Moreover, it also states that if all Ab for b ^ a are false then A a is true. Namely, 
(0 is equivalent to the conjunction, for a G A of: 

A a <-> f\ ->A 6 
beA\{b} 

Let us start by looking at the action occurrence. Let a be the action executed in state u, 
thus g a = {Al/l} U {Al/Q | b ^ a}. Hence, © is satisfied by cr a . 

Similarly, since the semantics require that actions are executed only if the executability 
conditions are satisfied, it holds that S \= 8 a .h (for at least one h G {1, . . . ,p a }, corre- 
sponding to a condition executable(a, S a ,h) in SC). This quickly leads to Vj=i $a,j i s 
true, and this allows us to conclude that © is satisfied by as ° cr a - 

Let us now consider the constraints dealing with fluents. We recall that S' is a set of 
fluent literals that is consistent, complete, and closed w.r.t. SC. Let us consider a fluent / 
and let us prove that constraint © of Figure[3]is satisfied. Assume, by contradiction, that 
Firedf'"cr and Fired"'" , t ,a are both true. Four cases must be considered: 

/ neg(/) 

1. Dynja and Dyn" eg ^cr are true. Since these values are determined by u, a, v, this means 
that both / and neg(/) belong to E(a, u). Since the closure under SC is monotonic this 
means that Lit(v) — S' is inconsistent, representing a contradiction. 

2. Dynja and StatJJ/ «cr are true. This means that / is in E(a,u) and neg(/) is added 
to S' by the closure operation. This implies that S' is inconsistent, which represents a 
contradiction. 

3. Static and Dyn" eg( .^cr are true. This leads a contradiction as in the previous case. 

4. Statjer and Stat^^cr are true. This means that / and neg(/) are added to S' by the 
closure operation. Thus, S' is inconsistent, which is a contradiction. 

It remains to prove that constraint (fTJ is satisfied by a. Let us assume that / E S'. Thus, 
FJas' is true. Three cases must be considered. 

1. / G E(a,u). This means that there is a dynamic causal law causes(a, /, where 
S \= ctf_i. From the definition, this leads to aj i cr being true and a a (A 1 ^) = 1. Thus, 
constraints |3]) and (j5J) set Dynja and Fired^a both true. As a consequence, constraint 
(J2J) is satisfied. 

2. / ^ E(a, u) and / G S. This means that / G S H S'. In this case Fired^^cr must be 
false, otherwise S' would be inconsistent (by closure). Thus, FJas should be true, F^as' 
is true and Fired"'" , «<t is false, which satisfy constraint (fTJ (regardless of the value of 
Firedy a). 

3. / ^ E(a, u) and / ^ S. This means that / is inserted in S' by closure. Thus, there is a 
static causal law of the form caused(7/j, /) such that S' |= jf.j. In this case, by (j4|, 
Stated is true and, by (]5]), so is Fired^'"er. Thus, constraint (|7j is satisfied. 

If / ^ 5", then neg(f) G <S" and the proof is similar with positive and negative roles 
interchanged. □ 



Let us observe that the converse of the above theorem does not necessarily hold. The 
problem arises from the fact that the implicit minimality in the closure operation is not 



12 



A. Dovier, A. Formisano, E. Pontelli 



reflected in the computation of solutions to the constraint. Consider the domain description 
where T = {f,g,h} and A = {a}, with the following laws: 

executable (a, [ ] ) . caused ( [g] , h) . 

causes (a, /,[]). caused ( [h] , g) . 

Let us consider S = {neg (/) , neg (g) , neg (h) }. Then, S' = {f,g,h} determines a 
solution of the constraint C l p v with the execution of action a, but Closc(E U (S n S')) = 
{/} C S'. However, the following holds: 

Theorem 2 {Weak Soundness) 

Let T) — (T>C, EC, SC). Let 0-5 o as> cr a identify a solution of the constraint Cp" . Then 

Clo sc (E(a,u)U(Sr\S')) CS'. 

Proof 

It is immediate to see that erg and erg/ uniquely determines two consistent and complete 
sets of fluent literals u and v. Let / be a positive fluent in Closc(E(a, it) U (S O S')). We 
show now that / 6 S'. 

1 . If / is in S (~1 S' we are done. 

2. If f £ E(a, u), there is a law causes(a, /, a/,i) such that S (= a/^. Since S 1 is determined 
by erg, by ([3]), we have that 0-5 o er a is a solution of i A A", which implies that Dyn^ is 
true, and as'(Ff) is tru e m &S'- Therefore, / S S". Observe also that cr making true 
will imply that <5" h is true (for some ft, G {1, . . . ,f> a })> which will imply satisfiability of 
the executability preconditions for a. 

3. We are left with the case of / S(a, u) and f £ S H S' . Since S" is determined by 0-5/, 
and / G Clo 5£ (i?(a, u) U (5 (~l 5')), there is a law caused(7/j, /) such that S' \= 7/j, 
and by construction erg/ makes 7? . true. Thus, statj is true and therefore is true. 
Hence, / G S' . 

The proof proceeds similarly in the case of a negative fluent neg(f) in Clos£ (E(a, u) U 
(5nS'))- □ 

Let us consider the set of static causal laws SC. We can introduce a notion of positive 
dependence graph, following the traditional principle of dependence analysis used in logic 
programming (e.g., dLin and Zhao 2004)). The graph Q(SC) is defined as follows: 

• the set of the nodes in Q(SC) corresponds to the set of fluent literals, i.e., 

Nodes{Q{SC)) = {/ I /€f}U {neg(/) | / G T) 

• edges are created to denote the dependence of a fluent literal on other literals due to 
a static causal law, i.e., 

Edges{G(SC)) = {(li,£ 2 ) I caused(L, £1) G SC, L= [ . . .,£2, . . .]} 

A set of fluent literals L is a loop if, for any £%, £2 G L, we have that there is a path from £\ 
to £2 in G{SC) such that all nodes encountered in such path are in L. We say that a domain 
specification V — (VC, EC, SC) is acyclic if the graph Q(SC) does not contain any loops. 
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Fig. 4. Sets of fluents involved in a state transition and a literal I introduced by closure. 

Theorem 3 (Acyclic Soundness) 

Let V — (T>C,£C,SC). Let as ° us' ° <?a be a solution of the constraint C^'". If the 
dependency graph of P is acyclic, then Closc(E(a, u) U (5 fl 5')) = S". 

Theorem|2]proves that Clos£(i?(a, u) U (5 fl 5')) C S". It remains to prove that for any 
(positive or negative) fluent I, if I e S", then £ S Clo 5£ (£:(a, tt)U(Sn S")). 

If i € £(a, u) or I G 5, then trivially I G Clo sc (E{a, ti)U(Sn 5")). 
Let us prove that (cf., Figure |4)l: 

(£ e S' A l i E(a, u)u(Sn S')) Clo 5£ (E(a, n)u(5n S')) 

To this aim, consider the dependence graph Q(SC). Because of the acyclicity of Q(SC), 
there are nodes in G(SC) without incoming edges — we will refer to them as leaves. For 
any node I of Q(SC), let d(£) denote the length of the longest path from a leaf of Q(SC) 
to I. We prove the property for a positive fluent literal I — f, by induction on d(£). 
Base case. If / ^ E(a, u) U (5* fl 5") is a positive fluent which is a leaf (the proof is similar 
for the case of negative literals), then two cases could be possible. 

• There is no law of the form caused(_, /) in SC. In this case, it cannot be that / € S' 
due to constraint (0). 

• There is a law caused([], /). In this case / G S' by closure. 

Inductive step. Let / ^ E(a, u) U (S fl S') be a positive fluent such that there are laws 
caused(7/ i i, /),..., caused(7/ i / l , /) in SC. By the inductive hypothesis, let us assume 
that the thesis holds for each fluent literal I such that d(£) < d(f). Since / ^ E(a, u) and 
/ ^ S fl 5', we have that FJ is false, .FJ is true, and Dyn^ is false under as ° o\s' ° cr a - 
From the fact that constraint ((7]) is satisfied, it follows that Stat ^ is true. Moreover, Dyn^ 
is false because / ^ E(a, u). On the other hand, because of |6]), we have that D Y n n e g(/)' 
Stat^ eg( -jj, and Fired"^^ are all false. Consequently, constraint ((7]) can be rewritten as 

FJ «-» VjLi l} j- Since f £ S' (i.e., is true), there must exists a j G {1, . . . , h} such 
that 7* j is verified by 0-5/ . This implies that, for each fluent g required to be true (resp., 
false) in Jfj, F£ is set true (resp., false) by as>- By inductive hypothesis, such fluent 
literals (either g or neg(g)) belong to Clo sc (E(a, u)U(Sn S')). Since Clo S£ (i?(a, u) U 
(S n S')) is closed w.r.t. the static laws, it follows that / G Clo S c(E(a, u)U(Sfl S')). 
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The proof in case of a negative fluent neg(f) is similar. 



□ 



In order to achieve soundness in cases where the graph Q(SC) contains loops, it is nec- 
essary to introduce additional constraints in conjunction with C^ v . Intuitively, in the se- 
mantics of B, cyclic dependencies created by the static causal laws are resolved by the 
closure operation Clos£(-) by minimizing the number of fluent literals that are made 
true — this derives by the implicit minimality of the closure. Additional constraints can 
be added to enforce this behavior; these constraints can be derived by following a prin- 
ciple similar to that of loop formulae commonly used in the context of logic program- 
ming ( Lin and Zhao 2 004). 

The notion of loop formulae can be developed in our context as follows. Let L = 
{£i, . ..,4}be a loop in G(SC) and let us consider the transition from u to v as stud- 
ied earlier. Let us define a counter-support for £j w.r.t. the loop L as a set of constraints cs 
with the following properties: 

• for each causes(aj, £j, a) in DC, cs contains either A™. = or Fjf — 1 for some 
£ in a; 

• for each caused(7, £i) in SC such that none of £±, . . . , £k is in 7, for some £ in 7 
cs contains Ft = 1; 

• cs contains either Ff = 1 or F? = 1. 

(As usual, we might identify a set cs of constraint with their conjunction, depending on the 
need.) Let us denote with Counters(£i, L) u ' v the set of all such counter-supports. The loop 
formulae for L w.r.t. u, v is the set of constraints 



Form(L) u ' v = {c x A • • • A c k F£ = A • • • A F/ fc = | 6 Counters^, L) u ' v }. 
To take into account all different loops in Q(SC), let Form(J)) u - v be the constraint 



Following the analogous proofs relating answer sets and models of a program comple- 
tion that satisfies loop formulae (e.g., dLin and Zhao 20 04)) one can show: 

Theorem 4 (Soundness) 

Let T> = (TJL, ££, SC) and let as ° s 1 & a be a solution of the constraint C^' v A 
Form(V) u > v . Thus, C±o sc {E{a, u) U (S D S')) = S'. 

Let the action description V meet the conditions of Theorem|4]and let (S, v, R) be its 
underlying transition system. The following can be proved. 

Theorem 5 

There is a trajectory (sq, ai, s±, 02, ... , on, sn) in the transition system (S, v, R) if and 
only if sq is closed w.r.t. SC and there is a solution for the constraint 






L is a loop in Q (SC) 



N-l 



A i c : 
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Proof 

The result follows directly by application of Theorems Q] and 0] and by observing that for 
each transition (sj , a J+ i , Sj+i), the satisfaction of constraint implies that the state 

Sj + i is closed w.r.t. SC. □ 

Let (V, O) be an instance of a planning problem where T> is an action description and 
O contains any number of axioms of the form initially(C) and goal(C). We can state 
the following. 

Corollary 1 

There is a trajectory (s , a\, s 1; a 2 , ■ ■ ■ , a-^, for the planning problem (T>, O) if and 
only if so is closed w.r.t. the static causal laws of T> and there is a solution for the constraint 

N-l 

f\ C s ° A f\ (C S J' S > +1 AForm(V) s ^ s ^) A f\ C SN 

initially(C)eCI j=0 goal(C)eO 



3.3 Mapping the Model to CLP(FD) 

The modeling described in Section [XT] has been translated into a concrete implementa- 
tion using SICStus Prolog. In this translation, constrained CLP variables directly reflect 
the Boolean variables modeling fluents and action's occurrences. Consequently, causal 
laws and executability conditions are directly translated into CLP constraints (and in- 
herit the corresponding completeness and soundness results). In this section we high- 
light the main aspects of the implementation — while the complete code can be found 
at |www . dimi . uniud . it /dovier/CLPASPl 

A plan with exactly N + 1 states, p fluents, and m actions is represented by: 

• A list, called States, containing N + 1 lists, each composed of p terms of the form 
fluent (f luent_name, Bool.var) . The variable of the i th term in the j th list is 
assigned 1 if and only if the i th fluent is true in the j th state of the trajectory. For 
example, if we have N = 2 and the fluents f , g, and h, we have: 

States = [ [fluent (f,X_f_0) , fluent (g,X_g_0) , fluent (h,X_h_0) ] , 
[fluent (f , X_f_l) , fluent (g, X_g_l) , fluent (h, X_h_l) ] , 
[fluent (f,X_f_2) , fluent (g,X_g_2) , fluent (h,X_h_2) ] ] 

• A list ActionsOcc, containing N lists, each composed of m terms of the form 
action (action_name, Bool.var) . The variable of the i th term of the j th list is 
assigned 1 if and only if the i th action occurs during the transition from state j to 
state j + 1. For example, if we have N = 2 and the actions are a and b, then: 

ActionsOcc = [ [ act ion (a, X_a_l ) , act ion (b, X_b_l ) ] , 
[ action (a, X_a_2 ) , action (b, X_b_2 ) ] ] 

The planner makes use of these structures in the construction of the plan; appropriate con- 
straints are set between the various Boolean variables to capture their relationships. For 
each list in ActionsOcc, exactly one action (a^, va^ ) contains a variable that is assigned 
the value 1 (cf., constraint ©). 
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(i) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

(9) 

(10) 

(11) 

(12) 

(13) 



clpplan {N, ActionsOcc, States) :- 
setof(F, fluent (F) , Lf ) , 
setof (A, action (A) , La) , 
make.states (N, Lf, States), 
make_action_occurrences (N, La, ActionsOcc), 
setof (F, initially (F) , Init), 
setof (F, goal (F) , Goal), 
set-initial ( Init , States), 
set.goal (Goal, States), 
set.transitions (ActionsOcc, States) , 
set.executability (ActionsOcc, States) , 
get_all_actions (ActionsOcc, AllActions) , 
labeling (AllActions } . 



Fig. 5. Main predicate of the CLP(FD) planner. 

The CLP implementation of the B language assumes that the action description is en- 
coded as Prolog facts — observe that the syntax of B is compliant with Prolog's syntax, al- 
lowing us to directly store the domain description as rules and facts in the Prolog database. 
The entry point of the planner is shown in Figure [5] 

The main predicate is clpplan (N, ActionsOcc, States) (line (1)) that computes 
a plan of length N for the action description present in the Prolog database. If such a plan 
exists, the variables in ActionsOcc and states will be instantiated so as to describe the 
found trajectory. 

Lines (2) and (3) collect the lists of all fluents (Lf ) and all actions (La). Lines (4) and (5) 
are used for the creation of the lists States and ActionsOcc. In particular, all the vari- 
ables for fluents and actions are declared as Boolean variables. Furthermore, a constraint 
is added to enforce that in every state transition, exactly one action can be fired. 

Lines (6) and (7) collect the description of the initial state (init) and the required con- 
tent of the final state (Goal). These information are then added to the Boolean variables 
related to the first and last state, respectively, by the predicates in lines (8) and (9). 

Lines (10) and (11) impose the constraints on state transitions and action executability, 
as described in Section |3~TI We will give more details on this part below. 

Line (12) gathers all variables denoting action occurrences, in preparation for the label- 
ing phase (line (13)). Note that the labeling is focused on the selection of the action to 
be executed at each time step. Some details on the labeling strategy are discussed in Sec- 
tion|8] Please observe that in the code of Figure[5]we omit the parts concerning delivering 
the results to the user. 

The main constraints are added by the predicate set_transitions. The process is 
based on a recursion across fluents and consecutive states. The predicate set_one_f luent 
is called (see Figure [6]i at the core of the recursion. Its parameters are the fluent F, the 
starting state FromState, the next state ToState, the list Occ of action variables, and 
finally the variables iv and ev, related to the value of the fluent F in FromState and 
ToState, respectively (see also Figure^. 

For a given fluent F, the predicate set_one_f luent collects the list DynPos (respec- 
tively DynNeg) of all the pairs [Action, Preconditions] such that the dynamic action 
Action makes F true (respectively false) in the state transition (lines (15) and (16)). The 
variables involved are then constrained by the procedure dynamic (lines (17) and (18)). 

Similarly, the static causal laws are handled by collecting the lists of conditions that 
affect the truth value of a fluent F (i.e., the variables StatPos and StatNeg, in lines 
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(14) 
(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 



set_one_f luent (F, IV, EV, Occ, FromState, ToState) 
findall ( [X, L] , causes (X, F, L) , DynPos) , 
findall ( [Y,M] , causes (Y, neg (F) , M) , DynNeg) , 
dynamic (DynPos, Occ, FromState, DynP, EV) , 
dynamic (DynNeg, Occ, FromState, DynN, EV) , 
findall (P, caused(P,F), StatPos) , 
findall (N, caused (N, neg (F) ) , StatNeg) , 
static (StatPos, ToState, StatP, EV) , 
static (StatNeg, ToState, StatN, EV) , 
bool.disj (DynP, StatP, PosFired) , 
bool.dis j (DynN, StatN, NegFired) , 
PosFired*NegFired #= 0, 

EV #<=> PosFired #\/ (#\ NegFired #/\ IV) . 



(27) 
(28) 
(29) 
(30) 
(31) 
(32) 
(33) 
(34) 



dynamic ( [ ] , _, _, [ ] , _) . 



dynamic ([ [Action, Precondition] |R], Occ, FromState, [Flag I Flags ] , EV) 
member (action (Action, VA) , Occ), 

get_precondition_vars (Precondition, FromState, ListPV) , 
length (ListPV, NPrec) , 
sum(ListPV, SumPrec), 

(VA #/\ (SumPrec #= NPrec) ) #<=> Flag, 
dynamic (R, Occ, FromState, Flags, EV) . 



(35) 
(36) 
(37) 
(38) 
(39) 
(40) 
(41) 



static ([],., [],-). 



static ( [CondlOthers] , ToState, [Flag I Flags ] , EV) 
get_precondition_vars (Cond, ToState, ListPV) , 
length (ListPV, NPrec), 
sum(ListPV, SumPV) , 
(SumPV #= NPrec) #<=> Flag, 
static (Others, ToState, Flags, EV) . 



Fig. 6. Transition from state to state. 

(19)-(20)) and constraining them through the procedure static (lines (21) and (22)). The 
disjunctions of all the positive and negative conditions are collected in lines (23) and (24) 
and stored in PosFired and NegFired, respectively. 

Finally, lines (25) and (26) take care of the relationships between all these variables. 
Line (25) implements the constraint (O for the state ToState of Figure[3] stating that we 
do not want inconsistent action theories. If PosFired and NegFired are both false, then 
ev = iv (inertia). Precisely, a fluent is true in the next state (EV) if and only if there is an 
action or a static causal law making it true (PosFired) or it was true in the previous state 
(iv) and no causal law makes it false. 

Let us consider the predicate dynamic (see line (27) in Figure [6]). It recursively pro- 
cesses a list of pairs [Action, Preconditions] . The variable VA associated to the execu- 
tion of action Action is retrieved in line (29). The variables associated to its preconditions 
are retrieved from state FromState and collected in ListPV in line (30). A precondition 
holds if and only if all the variables in the list ListPV are assigned value 1, i.e., when their 
sum is equal to the length, NPrec, of the list ListPV. If (and only if) the action variable 
va is true and the preconditions holds, then there is an action effect (line (33)). 

Similarly, the predicate static (line (35) in Figure |6]l recursively processes a list of 
preconditions. The variables involved in each of such precondition Cond are retrieved from 
the state ToState and collected in ListPV (line (37)). A precondition holds if and only if 
all the variables in the list ListPV have value 1, i.e., when their sum is equal to the length, 
NPrec, of ListPV. This happens if and only if there is a static action effect (see line (40)). 

Executability conditions are handled as follows. For each state transition and for each 
action Act, the predicate set_executability_sub is called (see Figure|7]). The variable 
va, encoding the application of an action Act is collected in line (44). A precondition hold 
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(42) 
(43) 
(44) 
(45) 
(46) 
(47) 
(48) 
(49) 
(50) 
(51) 
(52) 
(53) 
(54) 
(55) 



preconditions.f lags ( [C I R] , State, [Flag I Flags ] ) :- 
get_precondition_vars (C, State, Cs) , 
length (Cs, NCs), 
sum (Cs, SumCs) , 
(NCs #= SumCs) #<=> Flag, 
preconditions.!" lags (R, State, Flags). 



preconditions.f lags ( [ ] , _, []). 



set_executability_sub ( [ [Act, C] I CA] , ActionsOcc, State) :- 
member (action (Act , VA) , ActionsOcc) , 
preconditions.!" lags (C, State, Flags), 
bool.disj (Flags, F) , 
VA #==> F, 

set_executability_sub (CA, ActionsOcc, State). 



set_executability_sub ( [ ] , _, _} . 



Fig. 7. Executability conditions. 

if and only if the sum of the (Boolean) values of its fluent literals equals their number 
(lines (52)-(54)). The variable Flags stores the list of these conditions and the variable F 
their disjunction. If the action is executed (va = 1, see line (47)), then at least one of the 
executability conditions must hold. 

4 The Action Language with Constraints on Multi- valued Fluents 

As a matter of fact, constraints represent a very declarative notation to express relationships 
between unknowns. As such, the ability to use them directly in an action language greatly 
enhances the declarative and expressive power of the language, facilitating the encoding 
of complex action domains, such as those involving multi-valued fluents. Furthermore, the 
encoding of an action theory using multi-valued fluents leads to more concise and more 
efficient representations and better exposing non-determinism (that could be exploited, for 
example, by a parallel planner). Let us consider some representative examples. 

Example 2 (Maintenance Goals) 

It is not uncommon to encounter planning problems where along with the type of goals 
described earlier (known as achievement goals), there are also maintenance goals, rep- 
resenting properties that must persist throughout the trajectory. Constraints are a natural 
way of encoding maintenance properties, and can be introduced along with simple tem- 
poral operators. E.g., if the fluent fuel represents the amount of fuel available, then the 
maintenance goal which guarantees that we will not be left stranded could be encoded as: 
always (fuel > 0) . □ 

Example 3 (Control Knowledge) 

Domain-specific control knowledge can be formalized as constraints that we expect to be 
satisfied by all the trajectories. For example, we may know that if a certain action occurs 
at a given time step (e.g., ingest.poison) then at the next time step we will always perform 
the same action (e.g., call -doctor). This could be encoded as 



where occ (a) is a fluent describing the occurrence of the action a and f 1 indicates that the 
fluent / should hold at the next time step. □ 



caused ( [ occ (ingest .poison) ] , occ (call .doctor) 1 ) 
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Example 4 (Delayed Effect) 

Let us assume that the action request .reimbursement has a delayed effect (e.g., the in- 
crease by $50 of bankjiccount after 30 time units). This could be expressed as a dynamic 
causal law: 

causes (request jreimbur sement , incr (bankjiccount , 50 ) 30 , [ ] ) 

where incr is a constraint introduced to deal with additive computations — in a way closer 
to B's syntax we should write: 

causes {request jreimbur sement , bankjiccount 30 = bankjiccount + 50, [ ] ) . 

This is a particular case of additive fluents dLee and Lifschitz"2~0 03). □ 

In what follows we introduce the action description language B MV in which multi- 
valued fluents are admitted and constraints are first-class components in the description of 
planning problems. The availability of multi-valued constraints enables a number of im- 
mediate language extensions and improves the expressive power of the overall framework. 

Action description languages such as B rely on the common assumption, tradition- 
ally referred to as Markovian property in the context of systems and control theory: the 
executability of an action and its effects depend exclusively on the current state of the 
world (Gabaldon 2002; McCarthy 1998). Nevertheless, it is not uncommon to encounter 



real world situations where such property is not satisfied, i.e., situations where the exe- 
cutability and/or the effects of an action depend not only on what holds in the current 
situation, but also on whether some conditions were satisfied at a previous point in time. 
For example, an agent controlling access to a database should forbid access if in the recent 
past three failed password submission attempts have been performed by the user. 

Although non-Markovian preconditions and effects can be expressed in a Markovian 
theory through the introduction of additional fluents (and a correct handling of inertia), the 
resulting theory can become significantly larger and less intuitive. An alternative solution 
consists of admitting past references in modeling such kind of situations. In this frame of 
mind, B AIV allows timed references to past points in time within constraints, i.e., non- 
Markovian expressions that might involve fluents' values. Effects of dynamic laws that 
involves future references might also be specified. As a further feature the B MV language 
admits the specification of global constraints (involving absolutely specified points in time) 
and costs for actions and plans. 

The resulting description language supports all the kind of modeling and reasoning out- 
lined in the above Examples |2]-|4] 

In the next sections, we first introduce the syntax of the full-blown action description 
language £> My (Section[5]i. In Section|6]we will develop the semantics and the constraint- 
based abstract implementation of this new language. In doing this, for the sake of read- 
ability, we proceed incrementally in order to focus on the main points and features of the 
framework. We first consider the sub-language Bq IV obtained from B MV by disallowing 
timed references (Section fe.U : in Section [6721 we treat the general case dealing with past 
and future references. The abstract implementation is provided in Section 1631 Finally, we 
give the semantics to the complete language involving cost and global constraints (Sec- 
tion Ij 
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5 The Language B MV 

As for £>, the action signature consists of a set T of fluent names, a set A of action names, 
and a set V of values for fluents in T. In the following we assume that VCZ. 
In an action domain description, an assertion (domain declaration) of the type 

fluent (f,{d\,...,d k }) 

declares that / is a fluent and that its set of values is {d%, . . . , dk}', we refer to the set 
{di, . . . , dk} as the domain of /. We also admit the simplified notation f luent(/, di, (fe) 
to specify all the integer values in the interval [d\, d%] as admissible (with d\ < d-£). 

An annotated fluent (af) is an expression /*, where / is a fluent and ( 6 Z. We will 
often denote /° simply by /. Intuitively speaking, if t < then /* denotes the value that 
the fluent / had t steps ago in the past; similarly, if t > 0, then /* denotes the value / will 
have t steps in the future. We refer to annotated fluents with t > as positively annotated 
fluents. 

Annotated fluents can be used in Fluent Expressions (fe), which are defined inductively 
as follows: 

FE ::= d I AF FEi © FE 2 | - (fe) | abs(FE) I rei(FC) 

where d G V and © € {+, — , *, /, mod}, fc is a fluent constraint (see below). We refer 
to the fluent expressions rei (FC) as the reification of the fluent constraint FC — its formal 
semantics is given in Section loTTl 

Fluent expressions can be used to build primitive fluent constraints (PC), i.e., formulae 
of the form FEi op FE2, where FEi and FE2 are fluent expressions, and op is a relational 
operator, i.e., op G {=, 7^, >, <, >, <}. Fluent constraints are propositional combinations 
of primitive fluent constraints: 

PC ::= FEi op fe 2 
C PC I -iC I Ci A C 2 | Ci V C 2 

The constant symbols true and false can be used as a shorthand for true constraints 
(e.g., d = d, for some d G V) and unsatisfiable constraints (e.g., d 7^ d). 

The language £> MV allows one to specify an action domain description, which relates 
actions, states, and fluents using axioms of the following forms (PC denotes a primitive 
fluent constraint, while C is a fluent constraint). 

• Axioms of the form executable(a, C), stating that the fluent constraint C has to 
be satisfied by the current state for the action a to be executable. 

• Axioms of the form causes(a, PC, C) encode dynamic causal laws. When the ac- 
tion a is executed, if the constraint C is satisfied by the current state, then state 
produced by the execution of the action is required to satisfy the primitive fluent 
constraint PC. 

• Axioms of the form caused(Ci, C2) describe static causal laws. If the fluent con- 
straint C\ is satisfied in a state, then the constraint C2 must also hold in such state. 

An action domain description of B MV is a tuple (T>£, ££, SC), where £ C is a set of 
executability conditions, SC is a set of static causal laws, and DC is a set of dynamic 
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causal laws. In the following, we assume that positively annotated fluents can occur only 
in the effect part of dynamic causal laws. 

A specific instance of a planning problem is a pair (T>, O), where T> is an action do- 
main description and O contains any number of axioms of the form initially(C) and 
goal(C), where C is a fluent constraint. 

Example 5 

A sample action theory in B MV is: 

fluent (/, {1,2,3,4,5}) . 
fluent (<?, {1,2,3,4,5}) . 
fluent (h, {1,2,3,4,5}) . 
causes (a, f = g + 2, g < 3) . 
executable (a, true) . 
initially (/ = 1) . 
initially (g = 1) . 
initially (h = 1) . 
goal (/ = 5) . 

□ 

Notice that, for any given dynamic law causes(a, PC, C), such that a is executed in 
a state u satisfying C, the constraint PC has to be evaluates/satisfied in the target state v. 
Hence, the (relative) timed references occurring in PC (respectively, in C) are resolved 
with respect to v (resp., u). On the other hand, for a static law caused(Ci, C2), relative 
timed references of both C\ and C2 have to be resolved with respect to the current state. 

5.7 Absolute Temporal References 

The language B MV allows the definition of absolute temporal constraints, i.e., constraints 
that refer to specific moments in time in the trajectory (by associating the time point to 
the initial state), differently from the case of annotated fluents, where points in time are 
relative to the current state. A timed fluent is defined as an expression of the form 

FLUENT @ TIME. 

Timed fluents can be used to build timed fluent expressions (TE) and timed primitive con- 
straints (TC), similarly to what done for normal fluents. For instance, the constraint 

/@2 < 5 @4 

states that the value the fluent / has at time 2 in the plan is less than the value that the 
fluent g has at time 4. Similarly, ft@2 = 3 imposes that the fluent h must assume value 3 at 
time 2. 

Timed constraints can be used in the following kind of assertion: 

time_const raint(TC) 

The assertion requires the timed constraint TC to hold. 
Some other accepted constraints are: 
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• holds(FC, n): this constraint is a particular case of the previous one. It is satisfied 
if the primitive fluent constraint FC holds in the n th state. It is therefore a gen- 
eralization of the initially axiom. Observe that assertions of this kind can be 
used to guide the search of a plan by adding some point-wise information about the 
states occurring along the computed trajectory (e.g., this is useful to implement the 
landmarks model as used in the FF planner (Hoffman n et al. 20041 1 ). 

• always(FC): this constraint imposes the condition that the fluent constraint FC 
holds in all the states. Observe that FC has to be evaluated in all states, and its 
evaluation is strict — i.e., any reference to fluents outside the time limits leads to the 
satisfaction of the constraint; hence, annotated fluents should be avoided in FC. 

In specifying a planning problem (T>, O), we can consider such kinds of assertions as part 
of the observations O. 

Example 6 

Let us consider the case of an agent that has a certain amount of money (e.g., $5, 000) 
to invest; she is interested in purchasing as many stocks as possible. The stocks can be 
purchased from three trading agencies (1, 2, and 3); each agency has 1, 000 stocks available 
at $2 each. The stocks have to be purchased in separate transactions, but each trading 
agency require the agent to have a balance of at least $2, 000 at the start of the day before 
agreeing in the transaction. A purchase can be of at most 3, 000 shares at a time. 
We can model this problem with the following fluents: 

fluent (money, 0,5000). f luent(have(stockl), 0, 1000). 

fluent (have(stock2), 0, 1000). f luent(have(stock3), 0, 1000). 

fluent (available(stockl),0, 1000). fluent (available(stock2),0, 1000). 
fluent (available(stock3),0, 1000). 

f luent (price(stockl) , 2, 2). fluent (price(stock2) , 2, 2). 
fluent (price(stock3) , 2, 2)). 

The only action is 

actlon(buy(StockType,N)) : -N > 0,N < 3000. 

The executability condition for the action captures one property: the agent is accepted by 
the trading agency. 

executable(buy(Type,N),money@0 > 2000 A money > N * price(Type)). 

The dynamic causal law for this action is: 

causes {buy [Type, N), money = money — N * price(Type), true). 
causes(buy(Type, N), have{Type) = have(Type) + N, true). 

The initial state can be described as 

initially(price(stocfcl) = 2). initially(price(stocfc2) = 2). 

initially(price(sfocfc3) = 2). initially(/iawe(stocfel) = 0). 

±n±t±ally(have(stock2) — 0). initially(/iaue(stocfc3) = 0). 

initially(monei/ = 5000). ±n±t±ally(available(stockl) — 1000). 

±n±t±ally(available(stock2) = 1000). initially(auaiZaMe(stocfc3) = 1000). 

□ 
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5.2 Cost Constraints 

In B MV it is possible to specify information about the cost of each action and about the 
global cost of a plan (that is defined as the sum of the costs of all its actions). This type of 
information are useful to explore the use of constraints in determining optimal plans. 

The cost of actions is expressed using assertions of the following forms (where FE is a 
fluent expression built using the fluents present in the state): 

• action_cost(a, FE) specifies the cost of the execution of the action a as result of 
the expression FE. 

• state_cost(F£') specifies the cost of a state as the result of the evaluation of FE. 

Whenever, for an action or a state, no cost declaration is provided, a default cost of 1 is 
assumed. Once we have provided the costs for actions and states, we can impose constraints 
on the cumulative costs of specific states or complete trajectories. This can be done in B MV 
using assertions of the following types (where k is a number and op a relational operator): 

• cost_constraint(plan op k); the assertion adds a constraint on the global cost 
of the plan. 

• cost_constraint(goal op fc); the assertion imposes a constraint on the global 
cost of the final state. 

• cost_constraint(state (i) op k); the assertion imposes a constraint on the 
global cost of the i th state of the trajectory. 

As an immediate generalization of the above constraints, we admit assertions of the form 
cost_constraint(C), where C is a constraint, possibly involving fluents, where the 
atoms plan, goal, and state (i) might occur in any place where a fluent might — 
intuitively representing the cost of a plan, of the goal state, and of the ± th state, respec- 
tively. 

Some directives can be added to an action theory to select optimal solutions with respect 
to the specified costs: 

minimize_cost(i 7 'i?), 

where FE is an expression involving the atoms plan, goal, and state ( i ) , and possibly 
other fluents. This assertion constrains the search to determine a plan that minimizes the 
value of the expression FE. For instance, the two assertions minimi ze_cost (plan) and 
minimize_cost(goal) constrain the search of a plan with minimal global cost and with 
minimal cost of the goal state, respectively. 

We provide a more precise semantics for all these assertions in Section |6~4l In specifying 
a planning problem (T>, O), we consider cost constraints as part of the observations O. 



6 Semantics and Abstract Implementation of B v 

We will build the semantics of the language B MV incrementally. We will start by building 
the semantics for the sub-language of B MV devoid of any form of time reference and cost 
constraints (Section loTl i. This core language is called Bq IV . The subsequent Sections [6.21 - 
Eltreat the full B MV . 
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6.1 Semantics for Timeless Constraints 

Each fluent / is uniquely assigned to a domain dom(/) in the following way: 

• if fluent (/, Set) G V, then dom(/) = Set. 

A function v : T -> V U {_L} is a state if v(f) G dom(/) U {_L} for all / G T. The 
special symbol _L denotes that the value of the fluent is undefined. A state v is complete if 
for all / G T, v(f) ^ _L. For a number N ^ 1, we define a state sequence v as a tuple 
(vo, ■ ■ ■ , «n) where each is a state. 

Given a state u, and an expression <p, we define the value of y> in v (with abuse of 
notation, denoted by v(ip)) as follows^ 

• v(x) = x if a; G V 

• v (f) = v (f) if / G -T 7 (abuse of notation here) 

• = -( u (^)) 

u(abs(^)) = |(^))| (10) 

• © <y5 2 ) = V(tfil) © l)(( y 9 2 ) 

• w(rei(C)) = 1 if V \= C 

• w(rei(C)) = if v Y= C 

We treat the interpretation of the various © operations and relations as strict with respect 
to L (i.e., ±®x = x®± = ±, abs(_L) = _L, etc.). 

The last two cases in (fl~0T > specify the semantics of reification. Reified constraints are 
useful to enable reasoning about the satisfaction state of other formulae. The intuitive 
semantics is that a fluent expression rei(C), where C is a fluent constraint, assumes a 
Boolean value (0 or 1) depending on the truth of C. Note that the semantics of reified con- 
strains relies on the notion of satisfaction, which in turn is defined by structural induction 
on constrains, as follows. Given a primitive fluent constraint <p\ op (f2, a state v satisfies 
ipi op cf2, written v \= ipi op tp 2 , if and only if it holds that v(ipi) op v(<fi2) where the se- 
mantics of the arithmetic relators/operators is the usual one on Z. If either v(ip%) or v(ip2) 
is _L, we assume that v Y= ipi op ip 2 (and v ^ (p% nop tp 2 where nop is the negation of 
the operator op). Basically undefined formulas are neither proved nor disproved. The sat- 
isfaction relation |= can be generalized to the case of propositional combinations of fluent 
constraints in the usual manner. 

Given a constraint C, let fluents(C) be the set of fluents occurring in it. A function 
<7 : fluents(C) — ► V is a solution of C if a (= C. We denote the domain fluents(C) of 
the function a as dom(cr). In other words, a solution a of C can be seen as a partial state 
satisfying C. Observe that we require the solution to manipulate exclusively the fluents 
that appear in the constraint. 

Example 7 

Let us consider an action theory over the fluents /, g, h, where each fluent has domain 
{1, . . . , 5}. If C is the constraint / > g + 2, then a solution of C is a = {//5, g/2}. 
Note that the substitution = {f/5,g/2, h/1} is not a solution of C, since dom(#) ^ 
fluents(/ >g + 2). □ 

7 The expression \n\ denotes the (algebraic) absolute value of n. 
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Let a be a solution of a constraint C and v a state, with ine(cr, v) we denote the state 
obtained completing a in v by inertia, as follows: 



ine(o-,u)(/) = 



a(f) if / € dom(er) 
v(f) otherwise 



Example 8 

Let us continue with Example [7] If a = {f/5,g/2} and v = {//l, g/1, h/1}, then 
\ne(a,v) = {f/5,g/2,h/l}. □ 

An action a is executable in a state w if there is an axiom executable(a, C) such that 
v h C 1 - 
Remark 3 

As for the case of the language £>, also in B MV the executability laws express necessary 
but not sufficient preconditions for action execution (cf., Remark|2]i. Moreover, thanks to 
the generality of the constraint language — i.e., any propositional combination of primitive 
constraints can be used in B MV — the executable laws also allow direct formulation 
of non-executability conditions and the roles of the executable and nonexecutable 
axioms coincide. 

Let us denote with Dyn(a) the set of dynamic causal law axioms for action a. The effect 
of executing a in state v, denoted by Eff(a, v), is a constraint defined as follows: 

Eff{a,v) = f\{C causes(a,C,Ci) G Dyn{a),v |= d}. 



6.1.1 Bq without static causal laws 

Let us start by considering the simplified situation in which SC = 0, i.e., no static causal 
laws are specified in the domain description. 

During the execution of an action a, a fluent has to be considered as inertial, provided 
that it does not appear among the effects of the dynamic laws for a. In other words, since 
these effects are expressed through constraints, a fluent is inertial if it does not occur in any 
of the constraints specified in the dynamic laws for a. 

The description of the state transition system corresponding to a given action description 
theory {"DC, ££, 0} can be completed by defining the notion of transition. 

A triplet (v, a, v'), where v, v' are complete states and a is an action, is a valid state 
transition if: 

• the action a is executable in v, and 

• v' = ine(<7, v), where a is a solution of the constraint Eff(a, v). 

Let (D, O) be an instance of a planning problem, v = (vq, ■ ■ ■ , i>n) be a sequence of 
complete states and a\, . . . , on be actions. We say that (vq, a,x,V\, . . . , on, "n) is a valid 
trajectory if: 

• for each axiom of the form initially(C) in O, we have that vo \= C, 

• for each axiom of the form goal(C) in O, we have that un |= C, and 

• for all i e {0, . . . , N — 1}, (vi, aj+i, Vj+i) is a valid state transition. 



26 



A. Dovier, A. Formisano, E. Pontelli 



Example 9 

Let us consider the Example [5] Observe that ({//l, g/1, h/1}, a, {f/5,g/3, h/1}) is a 
valid trajectory. □ 

Remark 4 

Given a planning problem (V, O) in Bq IV , differently from what happens in the case of B, 
a solution to a planning problem is described by a valid trajectory, not just by a sequence of 
actions. This is the case because actions might have non-deterministic effects. For instance, 
let us consider Example[5] If the action a is executed and the precondition g<3 holds, then 
the dynamic causal law imposes the constraint f =g+2 in the reached state. There are many 
different ways to satisfy this requirement. Hence, in general, a sequence of actions might 
not characterize a unique state sequence. 

The same argument also applies to the action description language B MV , so in what 
follows we will consider the valid trajectories as the solutions of a planning problem. 

6.7.2 Abstract implementation in absence of static laws 

In this section we propose a constraint-based characterization of the state transition system 
defined in Section l6.1.1l Similarly to what we have done in the case of B, for any specific 
state, each fluent / will be represented by an integer-valued constraint variable. Boolean 
variables will instead model the occurrences of actions. 

Let u be a state; given a fluent /, we indicate with Ff the variable representing / in u. 
We generalize such a notation to any constraint C, i.e., we denote with C u the constraint 
obtained from C by replacing each fluent / £ fluents(C) by Ff. For each action a € A, 
a Boolean variable A" is introduced, representing whether the action is executed or not in 
the transition from u to the next state. 

Given a specific fluent /, we develop a system of constraints to constrain the values 
of Ff. Let us consider the dynamic causal laws that have / within their consequences: 

VCf = {causes(a, / ,i,C ? / ) i,Q!/ ) i), • • • , causes(a,- / , Cf >mf , af, mf )} 

For each action a we will have its executability conditions: 

£C a = {executable(a, 5 a ,i) : ■•• , executable(a, S a , Pa )} 

Figure [8] describes the constraints C u ^' v a that can be used in encoding the relations that 
determine the value of the fluent / in the state v (i.e., constrain the variable FJ) w.r.t. 
the application of the action a in the state u. After the settings of the domains (by (fTTT)). 
we impose through ( fT2b that if action a is executed, then at least one of the preconditions 
for its executability must hold in u. For each j £ {1, . . . , m/} the constraint ( TT3b defines 
a Boolean flag Dyrif ,■ that holds if and only if action ai s j is applicable in u and the 
preconditions of the j th dynamic causal law for / holds in u. The constraint ( TPfl ) requires 
that if Dyrif j is true, then the corresponding effects must hold in the new state v. Finally, 
inertia constraints are set by means of tl5i . 

We will denote with C^' v the conjunction of these constraints for all actions a £ A. 
Given an action domain specification over the signature (V, T , A) and two states u, v, the 
system of constraints C^ v includes: 
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Fig. 8. The constraints 0*'% for a state transition from u to v, for a fluent /. 

• the constraint Cj' v for each fluent literal / in the language of T 

• the constraint J^aeA — 1 (unique action execution in the state transition). 

The next theorem states completeness and soundness of the encoding described so far. 
We need a further piece of notation. Given two states u, v and an action a, let C^p a be the 
constraint obtained from Cp v by setting A a = 1, Af, = for all b ^ a, and FJ — u(f) 
for each fluent literal /. 

Theorem 6 

Let V = (T>C,£C,$) and let two states and a an action. Then (u,a,v) is a valid 
transition in the semantics of the language Bq IV if and only if v represents a solution of 
the constraint C^ a . 

Proof 

(=>) Let (u,a,v) be a valid transition. Then, a is executable in u. Hence u \= 5 a j for 
some j 6 {1, . . .,p a } and (PL?! is satisfied. By the definition of state we have that (fTTT i 
is also satisfied. Let v = ine(<7, it) with a solution of Eff(a, u). 

If / is a fluent not belonging to dom(cr) then / does not occur in Eff(a. u) and it is 
not affected by any dynamic causal law involved in the state transition. By definition 
of ine(-) we have that v(f) = u(f) and this satisfies constraint H5) . Satisfaction of 
constraints (l3[ and ([Pil l is immediately verified by observing that for all dynamic causal 
laws causes^ h , Cf^, ctf,h) having / in Cf t h, the constraint o.f,h is false m u - Then, 
the corresponding flag Dyn^j h is set false by ( TOI l. Consequently, (Tl4l) is satisfied. 
Assume now that / is a fluent in dom(cr). This means that there are dynamic causal 
laws causes (aj, h , C/^, atf,h) such that a/,^ is true in u, for h e X = {jx, . . . , j r } C 
{1, . . . ,rrif}. Consequently, the flag Dyn^ h is set true for /i£l and false otherwise. 
Since a is a solution of Eff(a,u), v satisfies the constraint Cj • for all j 6 X. This 
implies that ( fT4l is satisfied for each j € {1, . . . , m/}. Since some flags Dyn'j . are true 
constraint (TT~5T > is satisfied too. 
(<=) Assume that v satisfies the constraint C^ a . By (TL?! . because A a = 1, some of the 
constraints <J" h is satisfied. Hence, action a is executable in u. By the satisfaction of (fT~3b 
and (fl4] |. u satisfies all constraints CJ j for which the corresponding aj j is satisfied. 
Then, v is a solution for Eff(a,u). Consequently, since v = ine(?j,u) (by definition, 
since v is complete), (u, a, v) is a valid transition. 
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□ 

Let (T>, O) be an instance of a planning problem where T> is an action description and O 
contains any number of axioms of the form initially(C) and goal(C). We can state 
the following. 

Theorem 7 

There is a valid trajectory (vq, di, «i, cli, • • • > a N, «n) if and only if there is a solution for 
the constraint 

N-l 

/\ C v ° A y/y (C%' Vi+1 ) A f\ C VN 

initially(C)eO j=0 goal(C)GO 

Proof 

The result follows from (repeated) applications of Theorem[6] □ 

6.1.3 Adding static causal laws 

In this section we consider the case of action theories (T>£, £C, S£) of Bq IV , involving 
static causal laws (i.e., such that S£ ^ 0). 

The presence of static laws requires refining the semantics of the language, in order to 
ensure proper treatment of inertia in the construction of a valid trajectory. 

We start by defining three operations n, U, and A on states, as follows: 

ViUv 2 (f) 

v 1 nv 2 {f) 
A( Vl ,v 2 ,S)(f) 

where the set S used in A is a set of fluents. Observe that ine(cr, v) = A (a, v, dom(cr)). 
A state v is closed w.r.t. a set of static causal laws 

SC — {caused(Ci, Di), . . . , caused(C/c, D^)} 

if v \= (Ci — y Dx) A • • • A (Ck — > D k ). We denote this property as v \= SC 

Given two states v, v', a set of fluents D, and a set SC of static causal laws, we say that 
v' is minimally closed w.r.t. v, D, and SC if 

• v' \= SC (i.e., v' is closed) and 

• for all S C D, if A(v, v', S) ^ v' then A(v, v', S) \f= SC. 

The notion of minimally closed state is intended to capture the law of inertia, w.r.t. a given 
set D of fluents. Notice, in fact, that A(v, v', 0) = v'. Intuitively speaking, v' is minimally 
closed when it is obtainable from v by applying a minimal set of (necessary) changes in the 
values of the 'inertial' fluents (those in D). In other words, it is not possible to obtain from 
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Fig. 9. The set A(v, v', X) is obtained by combining a portion of v and a portion of i>', 
depending on the third argument X, which acts as a regulator in "mixing" portions of v 
and v'. The figure visualizes, in gray, the two sets A(v,v',D) (above) and A(v,v',S) 
(below) for S C D C T and illustrates the definition of minimal closure. A state v' is 
minimally closed if and only if v' \= SC and for all S C D, if A(w, v' , D) ^ u' then 
A(v, v' , 5*) ^= 5>C. In both cases, the surrounding frame represents the set T of all fluents. 



v a state different from v' and closed w.r.t. SC, by applying "fewer changes" than those 
involved in obtaining v' . A pictorial representation of A(v, v' , X) is shown in Figure[9] 
Observe that if SC = then v' is minimally closed w.r.t. v, D, and SC if and only if 

v = v'. 

Example 10 

Let /, g, h be fluents with dom(/) = dom(g) — dom(h) = {0, 1} and 

SC = {caused(/ = l-,g— 1), caused(/ = 0,g = 0)}. 
Consider the states v = {f/0,g/0,h/0},v' = {f/l,g/l,h/l},v" = {f/0,g/0,h/l} 
and let D — {/, g}. Then, v' and v" are both closed w.r.t. SC. 

However, v" is minimally closed w.r.t. v, D, and SC, while v' is not minimally closed 
since A(v, v' , D) — {h/1, f/0, g/0} is different from v' and closed. □ 

A triplet (v, a, v'), where v and v' are complete states and a is an action, is a valid 
transition if: 

1 . the action a is executable in v and 

2. we have that v' = ine(er, v') where 

• (j is a solution of the constraint Eff(a, v), and 

• v' is minimally closed w.r.t. v, T \ dom(er), and SC. 

Intuitively, the conditions that define a transition are designed to guarantee that: 
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• a solution a for the constraints describing the effects of the action is determined; 

• such solution is part of the new state v' constructed (thanks to v' = \ne(a, v')); and 

• the new state is minimally closed with respect to all the fluents not affected by the 
execution of the action. 

Let us observe that, since all fluents in the domain of any solution a of Eff(a, v) maintain 
the same value in v', it holds that v' |= Eff(a, v). 

Notice that the notion of a valid transition given in presence of static laws properly 
extends the one given in Section l6.1.1l In fact, the following property holds: 

Lemma 1 

If SC = then ine(er, v) — \ne(a, v'). 
Proof 

It is sufficient to note that, if SC = then v 1 is minimally closed w.r.t. T \ dom(a) if and 
only if ine(cr, v) — v'. □ 

Example 11 

Let us extend the action description of Example [10] We consider the following domain 
description: 

fluent (/, {0, 1}). fluent^, {0, 1}). 

fluent (/l, {0, 1}). 

action(a). executable(a, h = 0). 

causes(a, h = 1) 

caused(/ = l,g = 1). caused(/ = 0, g = 0). 

Let us consider the three states v = {//0, g/0, h/0}, v' = {//l, g/1, h/1}, and v" = 
{//0, g/0, h/1}. Then (v, a, v") is a valid transition, while (v, a, v') is not. □ 

Let (T>, O) be a planning problem instance. Let v = (vq, ■ ■ ■ , «n) be a sequence of 
complete states and let oi, . . . , bn be actions. Then (vq, a\, Vi, . . . , d\, wn) is a valid 
trajectory if the following conditions hold: 

• vq \= SC, and for each axiom initially(C) in O, we have that vq \= C; 

• for each axiom of the form goal(C) in O, we have that un |= C; 

• (vi, Oi+i, Vi + i) is a valid transition, for each i £ {0, . . . , N — 1}. 

6.1.4 Abstract implementation in presence of static laws 

Let us consider a fluent / and a transition from state u to state w, due to an action a, and let 
us adopt the same notation (Ff, C u , A", etc.) introduced in Section l6.1.2l The state tran- 
sition from u to w can be seen as the composition of two steps involving an intermediate 
state v. The first of these steps reflects the effects of the dynamic laws, whereas the second 
step realizes the closure w.r.t. the static causal laws. Hence we proceed by introducing a 
set of variables corresponding to the intermediate state v — ine(<7, u), where a is a solution 
of Eff(a,u). The constraint-based description of the first step is essentially the same we 
described in Section l6.1.2j — thus, we only need to extend the constraint system defined in 
Figure[8]to reflect the second part of the transition. 
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Fig. 10. The constraints for a state transition from u to w (with intermediate state v), for a 
fluent /. 



Given a set L C T of fluents, let SC l C 6>£ be the collection of all static causal laws 
in which at least one fluent of L occurs. Moreover, for simplicity, let SCf denote SC^fj, 
i.e., the set of all static causal laws that involve the fluent /. 

Let us define a relation R C T x T so that /1-R/2 if and only if SC j\ D SC / 2 ^ 0. R 
is an equivalence relation and it partitions T. Each element (i.e., equivalence class) of the 
quotient T jR is said to be a cluster (w.r.t. SC). Notice that a cluster can be a singleton 
{/}. Let / be a fluent, we denote with L f its cluster w.r.t. SC. 

Example 12 

Assume that SC consists of the rules 

caused(true, / = 1). caused(g = 2, h = 3). caused(/i < 5, r = 2). 

Then the two clusters are {/} and {<?, h, r}. □ 

Given a fluent /, let us consider the sets of dynamic and executability laws T>C / and 
£C a , as defined in Section 16.1.21 Moreover, let us consider the cluster containing /, let it 
be Lf = {/i, . . . , fk}, and the corresponding set of static causal laws SCl } '- 

SCl s = {caused(G/,i, -D/,i), • • • , caused(G/ : / l/ , D f j lf ) } . 

Figure [10] describes the constraints (to be added to those in Figure [H} that are used in 
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encoding the relations that determine the value of the fluent / in state w (represented 
through the variable FJ) after the execution of action a in the state u (we recall that v is 
to be considered as an intermediate state v = ine(<r, u)). 

The constraint (fTol i sets the domains for the variables FJ . The constraint (TTTb propagates 
to w the effects of the dynamic laws. Constraint < fT~8T > imposes closure w.r.t. the static causal 
laws. Finally, constraints ([T9l>-(l2"0l> require that if all the fluents in dom(er) that belong to 
the cluster L f are left unchanged in the transition, then all the fluents of L/ should not 
change their values. More precisely, as far as ( fT9] i is concerned, Staff is set to true if, for 
all fluents g in Lf, either g is not affected by the dynamic laws (i.e., F£ — F™), or for 
each activated dynamic law causes^. j , Cfj, aif,j) (i.e., such that its precondition aj ■ 
is true), g does not occur in its effects (i.e., in Cfj). Notice that, with respect to a specific 
state transition, we are not considering subject to inertia all those fluents that occur in the 
effects of (at least) one activated dynamic law. 

The enforcement of the constraint d20l i constitutes a necessary, but not sufficient, condi- 
tion for the target state to be minimally closed. We will discuss later on this point. 

Let us denote with Cj w the conjunction of the constraints (fTTt — (fT~8T> for all actions 
a £ A Given an action domain specification over the signature (V, T , A), the system of 
constraints C l p w includes: 

• the constraint CJ W for each fluent literal / in the language of T; 

• the constraint J2 a eA = 1- 

Similarly, let Stat^ w denote the conjunction of all the constraints of the forms JT9] i and 
©. 

The next theorem states completeness of the encoding described so far. Again, given 
two states u, w and an action a, let C*p a and Stat^ a denote the constraints obtained from 
Cp W and Stater" , respectively, by setting A a = 1, A\, = for all b ^ a, and FJ = u(f) 
for each fluent literal /. 

Theorem 8 

Let V = (VC,£C, SC) and let u,w two states and a an action. Then, if (u,a,w) is a 
valid transition in the semantics of the language Bq IV , then w represents a solution of the 
constraint C^ a A Stat^ 11 . 



Proof 

For the constraints (fTTl i-(fT6]i. considering the transition from u to v, the proof proceeds 
analogously to the first part of the proof of Theorem|6] 

Let us sketch the part of the proof regarding the effect of the static causal laws. Since 
(u, a, w) is a valid transition, w — ine(<r, w), w agrees with v = ine(cr, u) on all fluents 
in dom(cr), hence ( fTTT i hold. Moreover, w is closed w.r.t. SC, hence it satisfies (fT8T i. From 
the fact that w is minimally closed w.r.t. ine(er, u),J 7 \ dom(cr), and SC, it follows that w 
satisfies QIHSB. □ 

The above encoding does not guarantee soundness. This is because the constraints (fTTIi- 
( fT~8l > in Figures [8] and [10] might admit solutions not corresponding to minimally closed 
states. 
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We introduced the notion of cluster to partially recover the soundness of the encoding. 
Intuitively speaking, a cluster generalizes, to the multi-valued case, the notion of loop seen 
in Section l3~2l a cluster is a set of fluents whose values have been declared to be mutually 
dependent through a set of static causal laws. In a state transition, similarly to the case of 
loops, changes to the fluents of a cluster might occur because of their mutual influence, not 
being (indirectly) caused by dynamic laws. 

Constraints il9[ and ( f20b impose inertia on all the fluents of a cluster whenever none 
of them is influenced by dynamic laws. However, note that imposing ([T9]>-(l20l> does not 
completely circumvent the problem because state transitions violating the inertia are still 
admitted. In fact, (fT9b — (f20b do not impose inertia on the fluents of a cluster when at least 
one of them is changed by the dynamic laws. This might lead to invalid transitions, in 
which a change in the value of a fluent of a cluster happens even if this is not necessary in 
order to satisfy all the static causal laws. 

Nevertheless, we introduced the constraints ( fT9l ) and (f20]l because they constitute a good 
compromise w.r.t. the efficiency of a concrete implementation (as discussed later). 

To completely enforce soundness, we need to apply a filter on the solutions that are 
admitted by the encoding described so far. To this aim, let us introduce a condition on the 
values of the fluent, which is intended to mimic, in the multi-valued setting, the effect of 
loop formulae. 

Let us assume that the action a is executed in the state u, and that a, v, and w have been 
determined so that to satisfy the constraint C*p w . In this situation the following constraint 
characterizes an hypothetical state x, different from w: 



Intuitively, the satisfaction of such a formula witnesses the existence of a counterexample 
for the minimal closure of w. Notice that, being a, v, and w already determined, the only 
fluents/variables to be determined are those describing the state x, if any. The conjunct in 
line (fJTJ states that a; is a target state alternative to w; in particular, it enforces the closure 
of x w.r.t. SC. The conjunction (l22l states that x and w agree on the fluents in dom(er). 
Finally, (l23l states that x must differ from w and it must agree with u in at least one 
fluent — that, because of d22l . it is in T \ dom(cr). 

We can prove the following result, that generalizes Theorem|6]to the case of SC ^ 0. 

Theorem 9 

Let V = (T>C, £C, SC) and u, w two states, with u closed w.r.t. SC. Let a an action such 
that w represents a solution of the constraint C^' a . Then (u, a, w) is a valid transition in 
the semantics of the language Bq IV , if Form(T>) u ' a is unsatisfiable. 




(21) 



(22) 




(23) 



Proof 
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By proceeding as in the proof of Theorem [6j we can show that all needed conditions for 
(u, a, w) to be a valid transition are satisfied, except for the minimal closure of w. 

Let us assume, by contradiction, that w is not minimally closed w.r.t. u, T \ dom(cr), 
and SC. Then, there exists S C T \ dom(er) such that x = A (it, w, S) ^ w and x \= SC 
For each fluent f ^ Sit holds that Ff = Ff. Moreover, Ff = Ff holds too, because w 
satisfies C^ a . Hence, Dynf • — > Ff = Ff holds for all j. 

For each fluent /, since x is closed w.r.t. SC, we have that GJ j — > • (for all j 6 
{1, . . . , hf }). Observe that the conditions of the forms (TTTT>— (fT6b in the conjunct at line (fJTJ 
(i.e., in C^ x ) do not depend on the specific x. Then, the conjunct (fJTJ is satisfied. 

Let us also observe that condition (l22l i holds too. This is so because, for all / € dom(er) 
we have that Ff = Ff = FJ. From the fact that x ^ w it follows that V /e ^ Ff ^ Ff 
holds. Finally, the condition (l23l is satisfied because, whenever Ff ^ Ff holds, by the 
definition of A, it must be the case that Ff = Ff. It follows that Form(T>) u ' a is satisfiable 
(by x). 

This is a contradiction and proves that w is minimally closed w.r.t. u, T \ dom(cr), and 
SC, and that (it, a, w) is a valid transition. □ 

Let (P, O) be an instance of a planning problem, where 13 is a domain description and 
O contains any number of axioms of the form initially(C) and goal(C). We conclude 
this section by stating a generalization of Theorem[7]to the case of SC ^ 0. 

Theorem 10 

There is a valid trajectory (vq, oi, v\, a,2, ■ ■ ■ , o\, un) if and only if 

• wo H 

• There is a solution for the constraint 

N-l 

/\ C V0 A /\ C^ ,fl,+1 A /\ C" ,N 

initially(C?)eO J=0 goal(C)eO 

• For each j G {0, . . . , N — 1} the formula Form(T)) Vj - aj+1 is unsatisfiable. 

The result follows from Theorems [8] and [9] □ 
Remark 5 (Embedding ofB into Bq IV ) 

We conclude this section by showing that Bq IV is at least as expressive as B. To this aim 
it suffices to describe how to translate a domain description 23 of B to a SQ /y domain 
description 23', in such a way that the semantics of the domain is preserved. Let us outline 
the main points of such a translation. 

Each Boolean fluent / in 23 can be modeled in Bq IV by a multi-valued fluent /' whose 
domain is V = {0, 1} C Z. 

Each action in 23 uniquely corresponds to an action in 23'. 
Let us consider a dynamic causal law of 23, e.g., 



causes(a, /, [f 1: . . . , f k ,neq(gi), . . . , neg(gfc)]). 
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This law is translated in V as 

causes(a, /' = 1, [f{ = 1, . . . , f' k = 1, g{ = 0, . . .,g' h = 0]). 

In a similar manner, static laws and executability conditions of T> are mapped into Bq IV . 
Consequently, the two domain descriptions V and V describe two isomorphic transition 
systems. 

6.2 Adding annotated fluents and non-Markovian references 

In this section, we generalize the treatment described in Section 16.11 in order to provide 
a state-transition semantics for E ' v suitable to cope with temporal references. The first 
form of temporal references involves annotated fluents and concerns relative access to their 
past values, w.r.t. the current state. There is no restriction on the occurrences of this kind of 
annotated fluents: they might be used in all laws of a domain description. In this case, the 
extension of the semantics described in Section loTTI comes rather naturally. Since references 
may relate different points in time along the plan, the approach consists of considering 
sequences of states instead of pairs of states, to define the transition constraints. 

Regarding references to future points in time (i.e., positively annotated fluents), we recall 
that they are admitted in the consequences of dynamic causal laws only. This restriction 
allows the treatment of future and past references by exploiting the very same mechanisms. 
The semantics is further enriched in Section [6~4l to encompass state constraints specified 
by using absolute time references, as well as costs. 

Let v = (vo, . . . , i>n) be a state sequence. Given v, and i 6 {0, . . . , N}, we define the 
concept of value of <p in v at time i (with abuse of notation, denoted by v(i, <p)) as follows! 

v(i, x) — x if x G V 

v(i, f) = v i+j (f) if / e T, and s: i + j ^ N 
v(i,f j ) = v (f) if/e^andz+i<0 
v(i,p) = v N (f) iff eTandt+j >N 

v(i,abs(ip)) = \v(i,ip)\ 
v(i,-(<f)) = -(v(i,<p)) 
v(i, (pi © ip 2 ) = v(i, ipi) © v(i, ip 2 ) 
v(i,re±(C)) = 1 ifv\=i C 
v(i,rei(C)) =0 ifv ^ C 

where n e V, © G {+, *, /,mod}. 

As for ([Tol l of Section 16.11 the semantics of reified constraints relies on the notion of 
satisfaction, which in turn has to be contextualized to a specific point in time i. More 
formally, given a fluent constraint ipx opifi2 and a state sequence v, the notion of satisfaction 
at time i is defined as v |=,; ipi op</?2 <=> v(i 1 (fi) opv(i, Lp 2 ). The notion |=j is generalized 
to the case of propositional combinations of fluent constraints in the usual manner. 

Given a constraint C, let ^-fluents(C) be the set of annotated fluents for i ^ 0, 

8 A slightly simplified treatment could be described if only past references are admitted. In this case, we con- 
sider i to be the current point in time and j to be negative. The notation could then be simplified by considering 
just a prefix v = (vo, . . . , Vi) of the state sequence. 
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occurring in C. Given a state sequence v = (v , . . . , Vi), with ^ i < N, a function 
a : ^-fluents(C) — ► V is an i-solution of C w.r.t. v, if it holds that 

(v , ...,Vi, ine(cr| , Vi), (a\i), . . . , (cr| N _ (i+1) )) |= i+ i C, 

where each a\k (for fc ^ 0) is the restriction of the assignment a to the fluent annotated 
with k, and ~p denotes the substitution obtained by completing /x, with assignment to _L for 
all fluents not in dom(/x). Note that we treat the interpretation of the various operations as 
strict w.r.t. _L and we assume satisfied all constraints that refer to undefined expressions. 
Hence, for instance, if C is constraint and there is a sub-expression tp of C evaluated as _L, 
then we assume v \=i C. 

Example 13 

Let N = 3 and i = 1. Consider the constraint C = (g° = / _1 + ,/ -2 ) and let v = 
(v ,v 1 ) = ({f/2,g/l},{f/l,g/2}). 
Then a = {g/3} = a\o is a 1-solution of the constraint C, since 

. ine(a| ,{//l,ff/2}) = ine({ 3 /3},{//l, ff /2}) = {//l, 3 /3}, and 

. <{//2, g/1), {f/1, 5 /2}, {//l, 3 /3}, {f/±, g/±}, ) \= 2 g° = f- 1 + f~\ in fact, 

we have that v(2, C) is v(2, g°) = v(2, f~ 1 + f~ 2 ), which is equivalent to v 2 {g°) = 

vi(f) + v (f). 

□ 

A state sequence v = (vo, ■ ■ ■ , Vh) is closed w.r.t. a set of static causal laws 

SC = {caused(Ci, -Di), . . • , caused(Cfc, D^)} 

if for alH e {0, . . . , h} it holds that v \=i {d £>i) A • • • A (C k -> D k ). 

We also generalize the notion of minimal closure as follows: given a state sequence 
v = (vo, . . . , Vi) and a state v' we say that v' is minimally closed w.r.t. v, D, and SC if 

• (vo, ■ ■ ■ , Vi, v') is closed w.r.t. SC 

• for all sets of fluents S C D, if the state A(vj,t/, S 1 ) is different from w', then 
(«o, • • ■ , Vi, A(vi, v', S)) is not closed w.r.t. SC. 

The action a is executable in u at time i if there is an axiom executable(a, C) such 
that v \=i C. 

Let us denote with Dyn(a) the set of dynamic causal laws for an action a. The effects 
of executing a in v at time i, denoted by Eff(a, v, i), is 

Eff(a, v,i) = f\ {PC | causesfa, PC, C) G Dyn(a),v \=i C} 

Given a constraint C, we denote by s/ii/t* (C) the constraint obtained from C by replac- 
ing each fluent f x with J 2- '. 

Let us assume that v = (vq, . . . , V{) is a sequence of complete states and that a is a se- 
quence of actions (ai, . . . , a i+ i). The effects of the sequence of actions in v is represented 
by the formula 

i i 

E(i,d,v) = f\ shift^iEffiaj+uvJ)) A f\ f\ = Vj (f) 
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Let us observe that this constraint might involve all fluents of the states v , . . . , v i7 as 
well as fluents of future states. The values of fluents in states vo, . . . , are fixed by v. 

Let (D, O) be a planning problem instance, v = (vq , . . . , i>n) be a sequence of complete 
states and a\, . . . , a« be actions. Then, (vo, a\, v\, . . . , On, «n) is a valid trajectory if the 
following conditions hold: 

• (vo, . . . , vn) is closed w.r.t. <S£ 

• for each axiom of the form initial(C) in O, we have that v \= C 

• for each axiom of the form goal(C) in O, we have that v |=n C 

• for each i G {0, . . . , N — 1} the following conditions hold 

— action a i+1 is executable in v at time i and 

— we have that = ine(o-| , v i+ i) where 

- g is a z-solution of the constraint E(i, (a\, . . . ,a^}, (vo, ■ ■ ■ , «n-i)) w.r.t. 
(vo, ■ ■ -,Vi), 

- v i+ i is minimally closed w.r.t. (v 0} ■ ■ ■ , Vi), T \ dom(a), and SC. 

Example 14 

Let us consider the following domain specification and planning problem instance (for 
N = 2): 

f luent(/, 1, 5). 
f luent(<7, 1, 5). 
f luent(/l, 1, 5). 
action(a). 
action(6). 

executable(a, true). 
executable(6, true). 
causes(a, g = g^ 1 + 2, true). 
causes(6, /° = gr -1 + h~ 2 , true). 
initially(/ = 1). 
initially^ = 1). 
initially(/l < 3). 
goal(/ > 4). 
Observe that the only valid trajectory is 

({//l, h/2}, a, {f/1, 5 /3, h/2}, b, {//5, g/3, h/2}). 

The validity can be verified by observing that: 

• {f/1, .g/l, h/2} satisfies all the constraints provided in the initial declarations; 

• {//5, 5/3, h/2} satisfies the goal constraint / > 4; 

• the action a is executable in {v ) = ({f/1, g/l, h/2}) and action b is executable in 

(vo,vi) = ({f/l,g/l,h/2},{f/l,g/3,h/2}) 

(since both their executability laws and the action conditions are trivially true). 

• Consider the first state transition and i — and note that ^-fluents(5° = g^ 1 + 2) = 
{5}. Then, a' = {g/3} is a 0-solution of g° = g' 1 + 2 w.r.t. ({f /1, g/l, h/2}). In 
fact, ct'|o = a' , g'\\ = {}, and 
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— v x = ine(aV v ) = ine({. 9 /3}, {f/1, g/1, h/2}) = {f/1, g/3, h/2} 

— («b,«iX|i) = {v ,v 1 ,{f/±,g/±,h/±}) hi .9° =9~ l + 2. 

— «i is minimally closed w.r.t. {{f/1, g/1, h/2}), {/, h} and 0. 

• Consider the second state transition and i = 1 and note that ^-fluents(/° = g _1 + 
h ~ 2 ) = {/}■ Then < = {//5} is a 1-solution of /° = g" 1 + /i~ 2 w.r.t. (w ,Vi). 
In fact, a"\ = a", and 

— «a = ine(<r"| 0) «i) = ine({//5}, {//l, g/3, ft/2}) = {//5, g/3, h/2} 

— {v ,v u v 2 ) \= 2 f° = g' 1 + h- 2 . 

— V2 is minimally closed w.r.t. {vq, v%), {g, h} and 0. 



□ 



6.3 Abstract implementation of B 

The constraint encoding for B MV is similar to the one developed earlier for the case of 
Bq IV (cf., Figures [51 and ITOb. In the encoding of a trajectory {vq,o,i,vi, . . . ,cin>i>n) in 
Bq IV , we introduced a variable i 7 ^ to represent the value of the fluent / in the i th state Uj. 
In each state transition, say from Uj to u^+i, the implementation of Bq /v imposes only 
constraints involving the variables/fluents of the current state. In the language encompass- 
ing timed references, each constraint occurring in the action description can address the 
values that fluents assume in any of the states of the sequence v = {vq, . . . , «n). Since 
all the variables representing these values are present in the encoding, only the following 
change is needed to adapt to B MV the implementation designed for Bq IV : to obtain from a 
constraint C (involving fluents), a constraint C va (involving the corresponding variables), 
at time i, we replace each / 3 with the variable F v ^ +J . 

By adopting this refined construction for C v ' % , we can inherit all the results of Sec- 
tion l6.1.4l In particular, for an action description V, similarly to what done in Section l6.1.4l 
we denote by C^ a ' and by Form{'D) V:ai the constraints homologous to C^ -1 ' * and 
Form{V) Vi - 1 ' ai , respectively. 

The completeness result for B MV directly generalizes that obtained for Bq IV . With 
regards to soundness, the observation made w.r.t. Bq IV in Section 16.1.41 still applies. In 
fact, let {V, O) be an instance of a planning problem where V is a domain description and 
O contains axioms of the form initially(C) and goal(C). We state the following: 

Theorem 11 

There is a valid trajectory v — {vq, a\, v\, . . . , Vjss, Un) if and only if 

• vis closed w.r.t. S£ 

• There is a solution for the constraint 

N-l 

/\ C"<° A /\ C^ +1 A /\ C*< N 

initial ly(C)e0 j=0 goal(C)GO 

• For each j G {0, . . . , N — 1} the formula Form{V) v ' aj+1 is unsatisfiable. 
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6.4 Adding costs and global constraints 

Cost and time constraints can be introduced by filtering the solutions characterized by The- 
oremQTl in order to rule out the unsatisfactory solutions. More precisely, given a trajectory 
(vq, <Zi, Vi, . . . , bn, «n) satisfying the requirements of Theorem [TT] we say that the tra- 
jectory satisfies a set of global constraints as described in Sections 15.11 and 15.21 if all the 
constraints described next hold. 

Let us start by investigating the cost constraints. Let 

action_cost(ai, FE\), . . . , action_cost(aN, FEj^) 

and state_cost(i 7 'i?') be specified in the action description^ 

Let us recall that the general form of cost constraints is cost_constraint(C), where 
C is a constraint defined as in Section|5] with the added ability to refer to the atoms plan, 
goal, and state (i) wherever fluents can be used. Consequently, we extend our definition 
of value of an expression ip in v = (vo, . . . , i>n) at time i (for all j): 

v(j, plan) = wo(^i) + • • • + %-i(F£ N ) 

v(j,qoal) = V N (FE') 

v(j, state (i)) =Vi(FE') ifO^i^N 

(assigning cost constraints to to states outside the plan is senseless. However, for complete- 
ness, for i < or i > N we set v(j, state (i) ) = but any other choice — e.g., ±, or the 
values on states or N — is reasonable). This modification allows us to derive the notion 
of satisfaction of a cost constraint C from the notion of satisfaction defined in Section ROI 
As particular cases, we obtain that: 

• for each assertion cost_constraint(plan op k) the plan cost (vo{FEi) + • • • + 
un-i(-F-En)) has to satisfy the stated constraint, i.e., it must hold that (vq(FEi) + 
hv N -i(f%))op fc; 

• for each assertion cost_constraint(goal op k), the cost vn{FE') of the goal 
state must satisfy the constraint: v-^(FE') op fc; 

• for each assertion cost_constraint(state (i) op k), the cost Vi(FE') assigned 
to the i th state has to satisfy the constraint Vi(FE') op k. 

The handling of time constraints requires the following modifications: 

• for each assertion time_constraint(C), it holds that (vo, . . . , «n) |=o C, where 
each timed fluent /@z is evaluated as Vi(f); 

• for each assertion of the form holds(C, i) it holds that (vo, . . . , vjss) \=i C; 

• for each assertion of the form always(C), it holds that (v , . . . , %) \=i C for all 

ze{0,...,N}. 

Moreover, if minimize_cost(F£'") is specified, then there exists no other trajectory v' 
such that w'(N, FE") < w(N, FE"). As particular cases, we have that 

• if minimize_cost(plan) is specified, then there exists no other trajectory having a 
smaller plan cost; 

9 As mentioned, if some of these assertion is missing a default cost 1 is assumed. 
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• if minimi ze.cost(goal) is specified in the action description, then there is no tra- 
jectory (v' , a'^v'i, . . . , a^, v' N ), fulfilling all constraints, and such that v' N (FE') < 
vk(FE'). 

In this manner, we characterize the solutions of a given planning problem to be exactly 
those solutions described by Theorem[TT|that additionally satisfy all the global constraints, 
the requirements on costs, and the time constraints expressed in the action description. 
Soundness and completeness properties directly carry over. 

7 Concrete Implementation of B MV 

The overall structure of the concrete implementation of the language B follows that 
used for implementing the B language. We focus here on the main differences. 

To start, let us briefly describe the code depicted in Figure HTIan d show that this concrete 
implementation reflects the abstract one defined in Figure [8l 10 l Hence, we preliminarily 
ignore lines d65ll-(l66l> of Figure [TT] 

The first difference w.r.t. the implementation of B (cf., Section is that each fluent 
variable is assigned to a finite set domain, drawn from the fluent declaration — instead of 
being treated as a Boolean variable. 

The predicate set_one_f luent (lines d56l)-(l68i>) has a similar role as in the implemen- 
tation of B. Given the fluent FluentName, the relevant parts of the dynamic causal laws 
are collected in lines (T57T>— (T59b . The predicate zero.subterm is an auxiliary predicate that 
detects if a constraint involves a fluent — i.e., it looks for an occurrence of FluentName in 
the constraint imposed by the dynamic causal laws. All the fluents explicitly involved in 
the consequence of a dynamic law are collected. In line do*3T >. the variable ev identifying 
the fluent FluentName in the following state ToState is retrieved. 

The predicate dynamic (line d64l i) collects the list of Boolean flags DynFormula. If one 
of the variables in Dyn is true then the variable ev is involved in a constraint imposed by 
a dynamic causal law. In line d67l > the disjunction of these flag variables is computed in 
Formula (let us ignore, for the time being, the variable StatFormula). In line d68l > the 
inertia constraint is added: if Formula is false then the value of the fluent is left unchanged 
by the transition (i.e., iv = ev). This corresponds to the ine(-) operator. 

For each action Act affecting the value ev, the predicate dynamic (lines d69li-(l82li) re- 
trieves its preconditions and builds the constraint c involving ev that must be imposed if 
the preconditions are satisfied. The flag variable Flag in line (f80b is introduced to keep 
track of the fact that the action has occurred (i.e., va is true) and the corresponding precon- 
dition holds. If Flag is true then the constrain c is asserted (line dHTli). All flags are stored 
in a list (cf., the variable DynFormula in line d64li). 

Lines d83li- (ll 14b provide an excerpt of the definition of the predicate rel.parsing. 
This predicate is used to transform fluent expressions to internal expressions involving 
fluent variables, states is a list of states (each of them, in turn is a list of all the fluent 
variables). The first argument is the fluent expression and the second one is the output 

10 Observe that the concrete implementation uses the functors eq, neq, etc. to denote the primitive constraints 
=, 5^, etc. 
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(56) set_one_f luent ( fluent (FluentName, IV) , ActionOccs, Now, States) :- 

(57) f indall ( [Act, OP, FE1, FE2, L] , 

(58) (causes (Act, FC, L) , zero.subterm (FluentName, FC) , 

(59) FC =.. [OP, FE1, FE2 ] ) , Dyn), 

(60) state.select (Now, States, FromState) , 

(61) Next Is Now+1, 

(62) state.select (Next, States, ToState) , 

(63) member ( fluent (FluentName, EV) , ToState), 

(64) dynamic (Dyn, ActionOccs, FromState, DynFormula, Next, States), 

(65) cluster.rules (FluentName, Stat), %%% These 2 lines can be dropped in 

(66) static (Stat, States, Next, StatFormula) , %%% absence of static laws 

(67) bool.dis j (DynFormula, StatFormula, Formula), 

(68) #\ Formula #=> EV #= IV. 

(69) dynamic ([],.,., [],.,.). 

(70) dynamic ( [ [Act, OP, FE1, FE2, Prec] Rest] ,AOccs, State, [FlaglPFl] ,Now, States) 

(71) member (action (Act, VA) , AOccs), 

(72) Last is Now-1, %%% Looks for preconditions in FromState and before 

(73) get_precondition_vars (Last, Prec, States, ListPV) , 

(74) length (Prec, NPrec) , 

(75) sum(ListPV, SumPrec) , 

(76) %%% The effect is in the next state (Now=Last+l) 

(77) rel.parsing (FE1, Vail, Now, States), 

(78) rel_parsing(FE2, Val2, Now, States), 

(79) exp.constraint (Vail, OP, Val2, C) , 

(80) (VA #/\ (SumPrec #= NPrec)) #<=> Flag, 

(81) Flag #=> 0, 

(82) dynamic (Rest, ActionOccs, State, PF1, Now, States). 

(83) rel.parsing (Num, Num, _, _} :- 

(84) integer (Num) , ! . 

(85) rel.parsing (rei (RC) , Val, Time, States) :- 

(86) RC =. . [OP, El, E2] , 

(87) rel.parsing (El, Vail, Time, States), 

(88) rel.parsing (E2, Val2, Time, States), 

(89) exp.constraint (Vail, OP, Val2, Val), !. 

(90) rel.parsing (abs (FE) , Val, Time, States) :- %%% similar for - (FE) 

(91) rel.parsing (FE, Vail, Time, States), 

(92) Val #= abs (Vail) , ! . 

(93) rel.parsing (FE, Val, Time, States) :- 

(94) FE =. . [0P,FE1,FE2] , 

(95) member(OP, [ +, -, mod, / , * ] ) , 

(96) rel.parsing (FE1, Vail, Time, States), 

(97) rel.parsing(FE2, Val2, Time, States), 

(98) ( OP = + -> Val #= Vail + Val2; 

(99) OP = - -> Val #= Vail - Val2; 

(100) OP = * -> Val #= Vail * Val2; 

(101) OP = / -> Val #= Vail / Val2; 

(102) OP = mod -> Val #= Vail mod Val2 ), !. 

(103) rel.parsing (Fluent "Delta, Val, Time, States) :- 

(104) H is Time+Delta, 

(105) length (States, N) , 

(106) in.interval (H,N,E) , 

(107) state.select (E, States, State), 

(108) member (fluent (Fluent, Val) , State) , ! . 

(109) rel.parsing (Fluent @ Time, Val, _, States) :- 

(110) state.select (Time, States, State) , 

(111) member (fluent (Fluent, Val) , State) , !. 

(112) rel.parsing (Fluent, Val, Time, States) :- 

(113) state.select (Time, States, State), 

(114) member (fluent (Fluent, Val) , State). 

(115) parsing (Fluent, Val, State) :- 

(116) rel.parsing (Fluent, Val, 0, [State]). 

(117) exp.constraint (L, OP, R, C) :- 

(118) (OP == eg -> C #<=> L #= R; 

(119) OP == neg -> C #<=> L #\= R; 

(120) OP == geg -> C #<=> L #>= R; 

(121) OP == leg -> C #<=> L #=< R; 

(122) OP == gt -> C #<=> L #> R; 

(123) OP == It -> C #<=> L #< R) . 

Fig. 1 1 . Relevant parts of the B MV implementation. 
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internal expression. The argument Time represents the specific point in time in which a 
fluent is referred to (cf., the variable Now used in lines d69i>-d82l> and (1 1 24-b — d 1 34-b to specify 
the precise point in time in which a fluent expression/constraint has to be evaluated). The 
predicate in_interval called in line ( 1106b sets E = HifO<H<N, E = (resp., 
E = N) if H < (resp., H > N). Similarly, predicate exp.cons traint (lines ( II 17M 1231 
transforms fluent constraints into the corresponding constraints on the fluent variables. 

The above described fragment of implementation is completed with the code needed to 
handle initial and goal state specifications. Namely, for a specific instance of a planning 
problem (T>, O), as done for £>, all constraint on the initial state (resp., those on the goal 
state) are reflected by constraining the variables Ff in the representation of the initial (resp., 
final) state. 

We proceed by splitting the correctness proof into steps. We can now state the following 



Theorem 12 

The concrete implementation (partially depicted in FigurefTTTl is correct and complete w.r.t. 
the system of constraints of Figure [8] 



This result immediately follows from the above argument. In fact, the constraint (fTTT i of 
Figure[8]is implicitly rendered by domain assignment for CLP variables. Constraints (fT3l 
and (TBI are dealt with in lines (I57li-(|64"1). Line d68l i imposes constraint ( fT5l l. Concerning 
the sequentiality of the plan and the executability conditions (i.e., constraint (O), we can 
observe that the implementation does not differ from that of B (in Figure [TT] we omitted 
the corresponding code, see Figure |7). □ 

(124) static ([],_,-, [] ) . 

(125) static ([ [OP, FE1, FE2, Cond] I Others] , States, Now, [Flag | Flags ] ) :- 

(126) get_precondition_vars (Now, Cond, States, List), 

(127) length (List, NL) , 

(128) sum(List, Result), 

(129) rel.parsing (FE1, Vail, Now, States), 

(130) rel.parsing (FE2, Val2, Now, States), 

(131) exp.constraint (Vail, OP, Val2, C) , 

(132) (Result #= NL) #<=> Flag, 

(133) Flag #=> C, 

(134) static (Others, States, Now, Flags). 

Fig. 12. Static causal laws treatment 

Let us now consider the presence of static causal laws. In Figure[l2] we list the predicate 
used to add constraints for the static causal laws. Notice that the concrete implementation 
of FigureQ~2]contains a discrepancy with respect to the abstract one of Figure[l0] In partic- 
ular, the concrete implementation does not deal with an intermediate state (named v in the 
abstract implementation). The fluents of the target state are computed by exploiting direct 
relationships with the starting state of the transition. This allows us to introduce fewer CLP 
variables. 

11 When establishing completeness an soundness results for the concrete implementation, we assume the same 
properties hold for the real implementation of the CLP(FD) solver at hand (in our case, SICStus Prolog). 
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In line (165b of FigureQT|the predicate cluster.rules collects all the (static) conditions 
imposed on the fluents of the cluster of FluentName. The call to the predicates static 
(line (166b ) collects the list of Boolean flags StatFormula which are used to model the 
constraints ( fT9] l and < f20b of Figure [10] In line d67l i. the disjunction of these flag variables, 
together with those originating from the dynamic causal laws (i.e., DynFormula), is com- 
puted in Formula, as explained above. 

For each condition implied by a static causal law, the predicate static (lines ( 1124b - 
(1134b ) builds the constraint c that must be imposed to ensure closure. The flag variable 
Flag in line (1132b is introduced to reflect the satisfaction of the constraint. If Flag is true 
then the constrain c is asserted (line ( 1133b ). All such flags are stored in the list Flags (cf., 
the variable StatFormula). 

We have the following result: 

Theorem 13 

The concrete implementation (partially depicted in Figures [TT] and [TZi l is complete w.r.t. 
the system of constraints of Figures [8] and [10] 

Proof 

The result directly follows from the above argument. Constraint (fT6b of Figure [10] is im- 
plicitly rendered by the domain assignment for the CLP variables (let us remember that the 
intermediate state v is not explicit in the concrete implementation). Constraints (TTTb — (TT3T> 
are dealt with as done in Theorem [12] The conditions originating from the static causal 
laws are dealt with through the predicates cluster.rules and static. □ 

Let us observe that there is a second difference between the concrete implementation of 
Figures [TT]and[T2] and the abstract one of Figure[lOl no requirements for the unsatisfiability 
of Form(V) v ' ai are imposed in correspondence of the state transition from to Vi (for 
any i). This allows the generation of state transitions where the target state is potentially 
not minimally closed. This means that the concrete implementation may produce solutions 
(i.e., plans) that the abstract semantics would forbid because of the non-minimal effects of 
(clusters of) static causal laws. On the other hand, we reflect constraints ( TT~9b and (l20b as 
described earlier, through the predicates static (listed in Figure [12b and cluster.rules 
(whose obvious code is omitted). 

The final step in the design of the concrete implementation is the introduction of suitable 
restrictions on the labeling phase of the CLP solver. Notice that, if at step i in a trajectory, 
a consequence of a dynamic law involves a fluent /•? , for j > i, then such a constraint 
has to be evaluated considering as already assessed all the states Vh preceding m. Hence, 
the labeling has to proceed "left-to-right" w.r.t. the CLP variables that model the states 
vi, . . . ,Vi. In other words, when searching for a solution, the variables representing the 
state Vh have to be labeled before those representing the state Vh+i, for each Vh in the 
trajectory. The implementation of this labeling strategy is depicted in FigureQj] Moreover, 
observe that we impose further restrictions (through the predicate no_loop in lines ( 1147b - 
( 11551 )) to avoid loops in plans, i.e., to forbid those trajectories where the same state appears 
twice. 

To complete the implementation of B MV we need to take care of the cost-based con- 
straints, whose behavior relies on the optimization features offered by SICStus' labeling 
predicate: the labeling phase is guided by an objective function to be optimized. 
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(135) lm_labeling (Actionsocc, States) :- 

(136) lm_labeling (Actionsocc, States, 1) . 

(137) lm.labeling( [] , .) :- !. 

(138) lm_labeling ( [CurrAct I Actions ] , States, I) 

(139) lm_labeling_aux (CurrAct ) , 

(140) no.loop (States, I), 

(141) II is 1+1, 

(142) lm_labeling (Actions, States, II). 

(143) lm.labeling.aux ( [ ] ) . 

(144) lm_labeling_aux ( [action (_, A) I R] ) :- 

(145) indomain (A) , 

(146) lm_labeling_aux (R) . 

(147) no.loop (States, A) :- 

(148) state.select (A, States, StateA) , 

(149) no.loop(A, States, StateA). 

(150) no.loop (0, ., .) : - ! . 

(151) no.loop (B, States, StateA) :- 

(152) Bl is B-l, 

(153) state.select (Bl, States, StateB) , 

(154) StateA \== StateB, 

(155) no.loop(Bl, States, StateA). 



Fig. 13. Implementation of a leftmost labeling strategy. 



Constraints on costs, as well as absolute temporal constraints, are handled by asserting 
suitable CLP constraints on the variables that model fluent values. This is realized through 
the predicates listed in Figure[l4] In particular, set.cost.constraints deals with con- 
straints on actions/plans and states. For instance, set_statecosts (line 1 167b retrieves 
all the assertions of the form cost-constraint (state (I) OP Num) and imposes the 
corresponding constraints. A similar predicate set.goal (not reported in the figure) ac- 
complishes the same for the final state only. The predicate set.plancost acts similarly, 
using the predicate make_one_action_occurrences (lines ( |192t -( [T93l ) where the cost 
for each single action is considered. 

All the absolute temporal constraints defined in the action description are handled by 
the predicate set.time.constraint (cf., lines ( I194t - d202l l). Also in this case, direct ref- 
erences to CLP variables implement the references to fluent expressions in any absolute 
point in time. 

As mentioned, all these constraints can be seen as filters used to validate each trajec- 
tory found by the labeling phase. The planner described in Figures ITTT4T3I is completed 
by adding the code in Figure [14] Completeness of the implementation of the full B MV 
immediately follows from the above discussion. 



8 Experimental Analysis 

We implemented CLP-based prototypes of B and B MV . These have been realized in SIC- 
Stus Prolog 4, and they have been developed on an AMD Opteron 2.2GHz Linux machine. 
Extensive testing has been performed to validate our CLP-based approach. Here we con- 
centrate on a few representative examples. The source code of the implementations and the 
examples can be found at lwww . dimi .uniud. it/dovier/CLPASP[ No particular built-in 
predicates of SICStus have been used and therefore porting to other CLP-based Prolog 
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156) set_cost_constraints (States, PlanCost, GOALCOST) :- 

157) set.goalcost (States, GOALCOST), 

158) set.plancost (PlanCost ) , 

159) set.statecosts (States) . 

160) set.plancost (PC) :- 

161) f indall ( [OP, Num] , (cost .constraint (C) , C = .. [OP, plan, Num] ) , PlanCosts), 

162) set_plancost_aux (PlanCosts, PC) . 

163) set_plancost_aux ([],_) . 

164) set.plancost.aux ([ [OP, Num] I PlanCosts] , PC) :- 

165) add.constraint (PC, OP, Num) , 

166) set_plancost_aux (PlanCosts, PC) . 

167) set.statecosts (States) :- 

168) findalM [I, OP, N] , (cost-constraint (C) , C = . . [OP , st ate ( I ) , N] ) , Costs), 

169) set_statecost_aux (Costs, States) . 

170) set_statecost_aux ([],_} . 

171) set_statecost_aux ( [ [I, OP, Num] I StateCosts] , States) :- 

172) (state.cost (FE) , ! ; FE = 1), 

173) rel.parsing (FE, Val, I, States) , 

174) add.constraint (Val, OP, Num) , 

175) set_statecost_aux (StateCosts, States) . 

176) make_action_occs (N, ActionsOcc, PlanCost, Na) :- 

177) setof (A, action(A), La), 

178) length (La, Na) , 

179) make_action_occurrences (N, La, ActionsOcc, PlanCost). 

180) make_action_occurrences ( 1 , _, [], 0} . 

181) make_action_occurrences (N, List, [ Act I Act ionsOcc ] , Cost) :- 

182) Nl is N-l, 

183) make_action_occurrences (Nl, List, ActionsOcc, Costl), 

184) make_one_action_occurrences (List, Act, Cost2), 

185) get_action_list (Act, AList), 

186) fd_only_one (AList ) , 

187) Cost #= Costl+Cost2. 

188) make_one_action_occurrences ( [ ] , [], 0). 

189) make_one_action_occurrences ( [A I Actions ] , [action (A, OccA) lOccActs], Cost) :- 

190) make_one_action_occurrences (Actions, OccActs, Costl), 

191) f d_domain_bool (OccA) , 

192) (action.cost (A, CA) , ! ; CA = 1), %%%Default action cost = 1 

193) Cost #= OccA*CA+Costl . 

194) set_time_constraints (States) :- 

195) f indall ( [FE1, OP, FE2] , ( t ime.const raint (C) , C = . . [OP , FE1 , FE2 ] ) , TimeCs), 

196) set_time_constraints (TimeCs, States). 

197) set_time_constraints ( [ ] , _) . 

198) set_t ime.const raint s ( [ [FE1 , OP , FE2 ] I Rest ] , States) :- 

199) rel.parsing (FE1, Vail, ., States), 

200) rel_parsing(FE2, Val2, _, States), 

201) add.constraint (Vail, OP, Val2), 

202) set_time_constraints (Rest, States). 

203) add.constraint (L, OP, R) :- 

204) exp.constraint (L, OP, R, 1). 



Fig. 14. Handling of global constraints and costs. 



systems is straightforward. A porting to B-Prolog has been realized and used to participate 
in the 2009 ASP Competition^! 

In the rest of this section, we analyze the performance of the implementation on a di- 
verse set of benchmarks. For each benchmark, we compare a natural encoding using the 
traditional B language with an encoding using B M . 

The problems encoded in B have been solved using both the CLP(FD) implementation 



See the web site http : / /www . cs ■ kuleuven . be / ~dtai/ event s/ASP- competition/ Teams/Bpsolver-CLPFD ■ shtml| 



46 



A. Dovier, A. Formisano, E. Pontelli 



and implementations obtained by mapping the problem to ASP and using different ASP 
solvers (Smodels, Clasp, and Cmodels with different SAT-solvers). 

In order to solve a S-planning problem (T>, O) using an ASP solver, we have developed 
a Prolog translator that takes as input (T>, O) and the plan length n, and it generates an 
ASP program, whose stable models are in one-to-one correspondence with the plans of 
length n for (T>, O). This encoding follows the general ideas outlined in (Lifs chitz 1999K 
In particular, the definitions of fluent, action, and initially are already in ASP 
syntax. The length of the plan n is used to define the predicate t ime ( . . n ) . The ASP- 
based planner makes use of a choice rule to ensure that exactly one action is applied at 
each time step: 

l{occ (Act, Ti) : action (Act) }1 :- time(Ti), Ti < n. 
The predicate hold (Fluent, Time) defines the truth value of a fluent Fluent at a given 
time step (Time). The truth value of the fluents at time are given as facts describing the 
initial state; we require the initial state to be complete. The executability rules, the dynamic 
causal laws and the static causal laws are instantiated for each admissible time step. Finally, 
the goal conditions are added to define the predicate goal; the requirement that the goal 
has to be satisfied at the end of the plan is imposed using an ASP constraint of the form 

: - not goal. 

As far as the CLP-based implementations are concerned, we use a leftmost variable 
selection strategy. Moreover, we included a loop control feature to avoid the repetition of 
the same state in a trajectory (cf., the predicate no_loop in Figure [T3ll. 

Tables Q~|-[5] discussed in detail in the next subsections, illustrate an excerpt of the ex- 
perimental results. In order to simplify the comparison among the solvers, in each table 
we introduce an extra column, denoted by "Best ASP," which indicates the performance 
of an hypothetical ASP-solver that always acts as the best between all the ASP-solvers 
considered. 

The specific meaning of the various columns is as follows: 

• Instance: the name of the specific instance of the problem 

• Length: the plan length used in searching for a solution 

• Answer: indication of whether an answer exists or not for the given plan length 

• Iparse: the time required to ground the ASP encoding of the problem (using lparse 
1.1.1) 

• Smodels: the execution time using the Smodels system (using Smodels 2.32) 

• Cmodels: the execution time using the Cmodels system (using Cmodels 3.70 with 
different SAT solvers) 

• Clasp: the execution time using the Clasp system (using Clasp 1.0.2) 

• Best ASP: a summary of the best execution time across all the different ASP solvers 

• CLP(FD): the execution time using the CLP(FD)-based implementation of B. Exe- 
cution times have the form ti -K2, where t\ is the time needed for posting constraints 
and t 2 the time for solving the constraints (i.e., finding a plan) 

• B MV : the execution time using the B MV encoding of the problem. The first column 
is related to computations where no constraints for the plan cost are imposed. In- 
stead, the computations of the second column have a constraint that limits the plan 
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cost to the number in parenthesis. The format is t\ + t<x as explained in the previous 
point. 

In the remaining subsections we briefly describe the benchmarks tested and the obtained 
results. The actual encoding in B and B have been placed in the Appendix for the 
sake of readability. A summary and a discussion of all the experiments is presented in 
Section[8^6l 

8.1 Three-barrel Problem 

We experimented with different encodings of the three-barrel problem. Our formulation is 
as described in Example Q] Figure Q] and Section I A. 1 1 show the encoding of the problem 
(for N = 12) in B and in B MV , respectively. Notice that, in order to represent each multi- 
valued fluent / of the B MV formulation, a number of Boolean fluents have to be introduced 
in the B encoding, one for each admissible value of /. 

TableQ]provides the execution times (in seconds) for different values of N and different 
plan lengths. The results show that the constraint-based encoding of B outperforms the 
ASP encodings (if we consider both grounding and execution). In turn, the B MV encod- 
ing outperforms all other encodings. This can be explained by considering that the CLP 
encoding of this problem benefits from numerical fluents (in reduced number, w.r.t. the B 
formulation) and from arithmetic constraints (efficiently handled by CLP(FD)). 

8.2 2-Dimensional Protein Folding Problem 

The problem we have encoded is a simplification of the protein structure folding problem. 
The input is a chain a\ot2 • • ■ «n with cti S {0, 1}, initially placed in a vertical position, as 
in Figure [T3T-left. We will refer to each on as an amino acid. The permissible actions are 
the counter-clockwise/clockwise pivot moves. Once one point i of the chain is selected, the 
points ai, a 2, ■ ■ ■ , oci will remain fixed, while the points aj+i, . . . ,a n will perform a rigid 
counter-clockwise/clockwise rotation. Each conformation must be a self-avoiding-walk, 
i.e., no two amino acids are in the same position. Moreover, the chain cannot be broken — 
i.e., two consecutive amino acids are always at points at distance 1 (i.e., in contact). The 
goal is to perform a sequence of pivot moves leading to a configuration where at least k 
non-consecutive amino acids of value 1 are in contact. Figure Q3] shows a possible plan to 
reach a configuration with 4 contacts. Table [2] reports some execution times. Section lA2l 
reports the B action description encoding this problem. Since the goal is based on the 
notion of cost of a given state, for which reified constraints are used extensively, a direct 
encoding in B does not seem to be feasible. 

Let us consider the resolution of the instance depicted in Figure [TBI i.e., the folding of 
the input chain 1001001001 of n = 10 amino acids. Asking for a plan of 8 (resp. 10) moves 
and for a solution with cost ^ 4, our planner finds the 8-moves plan shown in Figure Oo- 
center in 50.46s (a 10-moves plan in found in 603.37s). By removing the two constraints 
that keep fixed a^: 
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Table 1. Experimental results with various instances of the three-barrel problem (timeout 24000sec). 
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Fig. 15. An instance of the HP-protein folding problem: initial configuration, a plan, and 
final configuration with 4 contacts between 1 -amino acids. 
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Table 2. The HP-protein folding problem: some results for different sequences, and plan 
lengths (timeout 12000sec). 



the solutions are found in 52.72s and 617.68s, respectively. On the other hand, by keeping 
fixed ct2 and adding the two constraints 

holds (x (3) eq 11,1). 
holds (y (3) eq 11,1). 

the execution time is reduced to 4.06s and 52.97s. Adding the additional constraints 

holds (x (4) eq 11,2). 
holds (y (4) eq 10,2). 

the plans are found in only 0.37s and 4.62s. This shows that the use of multi-valued fluents 
and the ability to exploit domain-specific knowledge, in the form of symmetry-breaking 
constraints, allows B MV to effectively converge to a solution. 



8.3 The Community Problem 

The Community problem is formulated as follows. There are M individuals, identified by 
the numbers 1,2,..., M. At each time step, one of them, say j, gives exactly j dollars 
to someone else, provided she/he owns more than j dollars. Nobody can give away all of 
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her/his money. The goal consists of reaching a state in which all the participants have the 
same amount of money. 

Table [3] lists some results for four variants of the problem: the person i initially owns 
2 * i dollars (instances Am), i + 1 dollars (instances Bm), i*i dollars (instances Cm), or 
i * (1 + i) dollars (instances Dm)- 

The representations of this problem are reported in Sections [A.3.1l and lA.3.2l 
Notice that the large number of Boolean fluents that have to be introduced in the B de- 
scription causes failures due to lack of memory during the grounding phase (these instances 
are marked "mem" in TableO. For all these experiments, the bound on memory usage was 
4 GB (for the grounder, the ASP-solvers, and the CLP(FD) engine). Observe that, in some 
cases, also the CLP(FD)-based solver for B runs out of memory, while the failures of the 
CLP(FD) solver for B MV have been caused by expiration of the time limit. In summary, 
the constraint-based encodings provides better performance in most of the instances, es- 
pecially considering their better scalability w.r.t. the size of the instances. This originates 
from the smaller number of numerical fluents and from the efficiency of the underlying 
constraint solver. 
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Fig. 16. On the left: a simple schema of the 1 1 rooms for the Gas-diffusion problem. The 
locked gates are in red color. The gas (in pink) is flowing through the open gate (in green), 
from Roomy to Roomi. On the right: some results for different instances (i.e., different 
goals and initial allocations of amounts of gas — see Section [84] >. 
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Table 3. Experimental results for instances of the Community problem, "mem" denotes out-of-memory failures. Some results are missing for the 
ASP solvers, for those instances that are unable to complete grounding. 



52 



A. Dovier, A. Formisano, E. Pontelli 



8.4 The Gas-diffusion Problem 

The Gas-diffusion problem can be formulated as follows. A building contains a number 
of rooms. Each room is connected to (some) other rooms via gates. Initially, all gates 
are closed and some of the rooms contain a quantity of gas — while the other rooms are 
empty. Each gate can be opened or closed — open (x, y) and close (x, y) are the only 
possible actions, provided that there is a gate between room x and room y. When a gate 
between two rooms is open, the gas contained in these rooms flows through the gate. The 
gas diffusion continues until the pressure reaches an equilibrium. The only condition to be 
always satisfied is that a gate in a room can be opened only if all the other gates are closed. 
The goal is to move a desired quantity of gas to one specified room. 

We experimented with instances of the problem where the building has a specific topol- 
ogy: there are eleven rooms, all having the same physical volume. Each room is connected 
to the other rooms via gates as depicted in Figure [16] Since all rooms have the same vol- 
ume, when equilibrium is reached between two rooms sharing an open gate, they will both 
contain the same amount of gas. 

A B MV specification of this planning problem is given in Section lA~6l We experimented 
with different instances of the Gas-diffusion problem obtained by considering different 
goal states and by requiring that some of the rooms have to be kept empty. Moreover, we 
seek plans of different length. Figure[l6](on the right) summarizes the results obtained. In 
particular, all instances share the same initial state: rooms 10 and 3 contain 128 moles of 
gas. All the other rooms are empty. Moreover, 

• in the instance Ai the goal state is: room 1 contains at least 32 moles of gas; 

• in all the instances the goal is: room 1 contains at least 50 moles of gas. The 
instances differ in the constraints imposed on the desired plan: 

— in the instance Bi, rooms 7, 9, and 4 must remain empty. This condition can 
be imposed by including in the action description the constraints 

always ( contains ( 7 ) eq 0) . 
always ( contains ( 9 ) eq 0) . 
always ( contains ( 4 ) eq 0) . 

— in the instance B2, rooms 7, 8, and 5 must be kept empty. 

— in the instance B3, only room 6 must be kept empty. 

— in the instance B4, no constraint is imposed. 

Observe that it is quite natural to design a B encoding of this problem, by exploiting 
the multi-valued fluents. On the other hand, adopting the naive approach used for the three- 
barrel problem would force the introduction of (at least) 128 distinct boolean fluents for 
each multi-valued fluent. Such a large number of boolean fluents generates a large state 
space, making the task of any solver for B considerably harder. 

8.5 Other Puzzles 



We report results from two other planning problems. The first — 3x3-puzzle — is an encod- 
ing of the 8-tile puzzle problem, where the goal is to find a sequence of moves to re-order 



Multi-valued Action Languages in CLP(FD) 



53 



the 8 tiles, starting from a random initial position. The performance results for this puzzle 
are reported in Table [4] The second problem is the well-known Wolf-goat-cabbage prob- 
lem. The performance results are reported in Table [5] 

Notice that these planning problems are predominantly Boolean. The constraint-based 
encodings perform well in solving the instances of the Wolf-goat-cabbage problem. In 
contrast, for the 8 -tile puzzle problem, the use of numerical fluents allows us to achieve a 
compact encoding, but it does not necessarily lead to a better performance w.r.t. ASP. 

8.6 A Summary of the Experiments 

Table |6]pictorially summarizes some of the results relating the performance of the different 
approaches. For each problem instance, we compare the execution times obtained by the 
best ASP-solver and the CLP(FD) solvers for B and B MV action description languages. 
We considered only those instances for which at least one of the solvers gave an answer. A 
score of 1 (0, —1) is assigned to the fastest (second fastest, slowest) solver. The scores of 
all instances of a problem have been summed together, and this provides the radius of the 
circles in the figure. Instances have been separated between "Yes" instances (they admit a 
solution) and 'Wo" instances (they have no solutions). 

The success of the constraint-based approach is evident. However, it is interesting to 
observe that the planning problems that do not make significant use of non-boolean flu- 
ents tend to perform better in the ASP-based implementations — possibly due to the greater 
efficiency of ASP solvers in propagating boolean knowledge during search for a solution. 
Conversely, when numerical quantities are relevant in modeling a planning problem, the 
use of multi-valued fluents and constraints not only reduces the modeling effort, yielding 
more concise formalizations, but also requires a smaller number of fluents (compared with 
the analogous Boolean encoding). This, combined with the use of constraints, often trans- 
lates into a smaller state space to be explored in finding a solution. These seem to be the 
main reasons for the better behavior provided by the B MV approach. 

The distinction between "Yes" and "No" instances is also very relevant. The CLP-based 
solvers tend to perform better on the "Yes" instances, especially for large instances. It is 
interesting to observe that a similar behavior has been observed in recent studies comparing 
performance of ASP and CLP solutions to combinatorial problems dDovier et al. 20051 
IDovier et al. 20071 IDovier et al. 2009al l. 

9 Related work 

The literature on planning and planning domain description languages is extensive, and 
it would be impossible to summarize it all in this context. We focus our discussion and 
comparison to the papers that present languages and techniques similar to ours. 

The language investigated in this work is a variant of the language B originally in- 
troduced in ( Gelfon d and Lifschitz 1998b , as presented in dSon et al. 20011 Sect. 2). Apart 
from minor syntactical differences, any action description V from the language of dSon et al. 20011 1 
can be embedded in our B. The semantics for B presented here reproduces the one of (Gelfon d and Lifschitz 1 998). 

The language ADC has been introduced in (Baral et al. 2002) to model planning prob- 
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Table 4. Experimental results for instances of the 8-tile puzzle problem (timeout 36000 sec). 
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1.23 


0.57 


0.57 


0.10+0.07 


0.14+0.06 


34 


N 


0.17 


3.61 


38.62 


51.22 


3.11 


5.86 


3.11 


0.13+1.75 


0.10+1.60 


35 


Y 


0.18 


1.39 


31.10 


30.25 


3.20 


4.21 


1.39 


0.15+0.54 


0.08+0.32 


36 


N 


0.18 


4.55 


43.97 


57.21 


4.24 


12.68 


4.24 


0.13+1.87 


0.11+1.79 



Table 5. Experimental results for instances of the Wolf-goat-cabbage problem. 

lems in presence of actions with duration and delayed effects. The language relies on multi- 
valued fluents, akin to those used in our language. ADC actions have two types of effects: 

1 . Direct modification of fluent values, described by dynamic causal laws of the forms 

a causes / = g(f, /i ,...,/„, t) from t x to t 2 (24) 
a contributes g(f, /i ,...,/„, t) to / from t x to t 2 (25) 

The first axiom describes the value of the fluent / as a function, that modifies its 
value over the period of time from t\ to ti — these represent time units relative to the 
current point in time. The second axiom is similar, except that it denotes the quantity 
that should be added to the value of / over the period of time. These axioms are 
important when describing actions whose effect has a known duration over time 
(i.e., the interval of length t 2 — t\). 

2. Indirect modifications through the initiation and termination of processes, that can 
modify fluents until explicitly stopped; the axioms involved are axioms for the cre- 



ation and termination of processes: 

ai initiates p from t x (26) 

a 2 terminates p at t 2 (27) 

and axioms that describe how processes modify fluents 

p is.associated_with / = g(f, f u ...,/„, t) (28) 

p is_associated_with / <- g (f, /i ,...,/„, t) (29) 



The first axiom describes how the value of the fluent / will change as a function 
of time once a process is started; the second axiom determines how the value of / 
changes while the process p is active. 
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Three-barrel 



Community 



Goat&C. 



Tile-puzzle 



a. 

< 

E— 
in 
W 
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2Q 



Q 
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u 



EQ 



"No" instances 



o- 

< 

E— 
in 
oj 
CO 

2Q 



O 

o 



Q 

H-l 

cT 

o 



EQ 



"Yes" instances 



< 

H 

W 
CO 

EQ 



Q 

IX. 
CL 

U 

23 



Q 

w 

-J 

U 



All instances 



Table 6. Relative performance of the solvers for each set of instances (the radii of the 
circles are proportional to the performance of the specific solver). 

ADC has some similarities to B MV \ they both allow multi-valued fluents and some forms 
of temporal references. B MV has the flexibility of allowing non-Markovian behavior and 
it allows references to values of fluents at different time points, features that are missing in 
ADC. On the other hand, ADC allows the representation of continuous time and the ability 
to describe continuous changes to the value of fluents. 

Several features of ADC can be reasonably simulated in B MV ; we will focus on the 
axioms of type (|26ii-(|29l), since these subsume the capabilities of axioms d24b and d25l ): 

• we can represent each process p using a corresponding fluent; 

• the axioms (l26l l and (l2Tb can be simulated by 

causes(a 1 ,p* 1_1 = l,true) causes(d2, p* 2-1 = 0, true) 

• the axiom ( |28l l can be simulated by introducing the static causal law 

causecl(p > 0, / = gtf- 1 , /f 1 , . . . , j^ 1 ,^ 1 ) A p = fT 1 + 1) 

Note that, due to the inability of B MV to handle continuous time, we are considering only 
discrete time measures. 



The language C + proposed in ( Giunchiglia et al. 2004a i also has some similarities to the 
language B MV . C + does not offer capabilities for non-Markovian and temporal references, 
but supports multi- valued fluents. The syntax of C + builds on a language of fluent constants 
(each with an associated domain) and action names (viewed as Boolean variables): 
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• Static causal laws 

caused F if G 

where F and G are fluent formulae (i.e., propositional combinations of atoms of 
the form f = v for / fluent and v <E dom(/)). The language introduces syntactic 
restrictions that are effectively equivalent to preventing cyclic dependencies among 
fluents. Static causal laws describe dependencies between fluents within a state of 
the world. 

• Fluent dynamic laws 

caused F if G after H 

where F and G are fluent formulae and H is a formula that may also contain action 
variables. The semantics of dynamic laws can be summarized as follows: if H holds 
in a state, then the implication G — > F should hold in the successive state. 

• Actions that can be freely generated are declared to be exogenous 

exogenous a 

• Fluents can be declared to be inertial (i.e., they satisfy the frame axiom) 

inertial / 

The relationships between the two languages can be summarized as follows: 

• C + is restricted to non-cyclic dependencies among fluents, while B MV lifts this re- 
striction. 

• C + is capable of identifying fluents as inertial or non-inertial, while B v focuses 
only on inertial fluents (though it is relatively simple to introduce an additional type 
of constraint to create non-inertial fluents). 

• C + can describe domains where concurrent actions are allowed — by allowing occur- 
rences of different action variables in the H component of the fluent dynamic laws; 
although B MV does not currently supports this feature, a similar extension has been 
investigated in a recent paper (IDovier et al. 2 009b). 

Subsets of B and C + can be shown to have the same expressive power; in particu- 
lar, let us consider the subset of C + that contains only domains that meet the following 
requirements: 

• there are no concurrent actions — i.e., each H contains exactly one occurrence of an 
action variable; thus 

caused F if G after a AH 

where H is a fluent formula; 

• for each action a, there is a declaration 

exogenous a. 

Under these restrictions, it is possible to map a C + domain D to an equivalent domain in 
B MV . In particular: 

• for each non-inertial fluent /, with default value v, we introduce the static law 

causedCT 1 ^v,f° = v) 
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• for each static causal law caused F if G we introduce a causal law caused(G, F) 

• for each fluent dynamic law r of the form caused F if G after a A H, we introduce 
the following axioms (where execs is a fresh fluent): 

causes(a, execjr = 1, H) 
causes(et, execs 1 = 0,H) 
caused(ea;ec_r = 1 A G, F) 

Logic programming, and more specifically Prolog, has been also used to implement 



the first prototype of GOLOG (as discussed in (Levesque et al. 1997 1). GOLOG is a pro- 
gramming language for describing agents and their capabilities of changing the state of 
the world. The language builds on the foundations of situation calculus. It provides high 
level constructs for the definition of complex actions and for the introduction of control 
knowledge in the agent specification. Prolog is employed to create an interpreter, which 
enables, for example, to answer projection queries (i.e., determine the properties that hold 
in a situation after the execution of a sequence of actions). The goals of GOLOG and the 
use of logic programming in that work are radically different from the focus of our work. 

The work by (Thielscher 2002a) takes a different perspective in using constraint pro- 
gramming to handle problems in reasoning about actions and change. Thielscher's work 
builds on the use of Fluent Calculus (Thielscher 19991 ) for the representation of actions 
and their effects. Fluent calculus views states as sets of fluents, constructed using an op- 
erator o, and with the ability to encode partially specified sets (e.g., f\ o / 2 o Z where Z 
represents the "rest" of the state). In (Thielscher 2002a), an encoding of the fluent calcu- 
lus axioms using Constraint Handling Rules (CHRs) is presented; the encoding uses lists 
to represent states, and it employs CHRs to explicitly implement the operations on lists 
required to operate on states — e.g., truth or falsity of a fluent, validation of disjunctions 
of fluents. The ability to code open lists enables reasoning with incomplete knowledge. 
Experimental results (reported in (Thielscher 2002b)) denote a good performance with re- 
spect to GOLOG. The framework is very suitable for dealing with incomplete knowledge 
and sensing actions. Differently from our framework, it does not support non-Markovian 
reasoning, multi-valued reasoning, and it does not bring the expressiveness of constraint 
programming to the level of the action specification language. The use of constraints in the 
two approaches is radically different — Thielscher's work develops new constraint solvers 
to implement reasoning about states, while we use existing solvers as black boxes. 

A strong piece of work regarding the use of constraint programming in planning is ( Vidal and Geffner 2006 ) 
The authors use constraint programming, based on the CLAIRE language (Cas eau et al. 20021 . 
to encode temporal planning problems and to search for minimal plans. They also use a se- 
ries of interesting heuristics for solving that problem. This line of research is more accurate 
than ours from the implementation point of view — although their heuristic strategies can 
be implemented in our system and it would be interesting to exploit them during the label- 
ing phase. On the other hand, the proposal by Vidal and Geffner only deals with Boolean 
fluents and without explicitly defined static causal laws. 

Similar considerations can be done with respect to the cited proposal by Lopez and 



Bacchus (Lopez and Bacchus 2003 1. The authors start from Graphplan and exploit con- 



straints to encode fc-plan problems. Fluents are in this case only Boolean (not multi-valued) 
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and the process is deterministic once an action is chosen (instead, we deal also with non- 
determinism, e.g., when we have consequences such as / > 5). The proposal of Lopez and 
Bacchus does not address the encoding of static causal laws. 

10 Conclusions and Future Work 

In this paper, we investigated the application of constraint logic programming technology 
to the problem of reasoning about actions and change and planning. In particular, we pre- 
sented a modeling of the action language B using constraints, developed an implementation 
using CLP(FD), and reported on its performance. We also presented the action language 
B MV , which allows the use of multi-valued fluents and the use of constraints as conditions 
and consequences of actions. Once again, the use of constraints is instrumental in making 
these extensions possible. We illustrated the application of both B and B MV to several 
planning problems. Both languages have been implemented using SICStus Prolog. 

We consider the research and the results discussed in this paper as a preliminary step in a 
very promising direction. The experimental results, as well as the elegance of the encodings 
of complex problems, shows the promise of constraint-based technology to address the 
needs of complex planning domains. A number of research directions are currently being 
pursued: 

• we have introduced the use of global constraints to encode different forms of pref- 
erences (e.g., action costs) and control knowledge. Global constraints have been 
widely used in constraint programming to enhance efficiency, by providing more 
effective constraint propagations between sets of variables; we believe a similar use 
of global constraints can be introduced in the context of planning — e.g., the use of 
techniques used to efficiently handle the al ldi f f erent global constraint to enforce 
non-repetition of states in a trajectory. 

• We also believe that significant improvements in efficiency can be achieved by del- 
egating parts of the constraint solving process to an efficient dedicated solver (e.g., 
encoded using a constraint platform such as GECODE, possibly enhanced with local 
search moves). 

• The encoding in CLP(FD) allow us to think of extensions in several directions, such 
as the encoding of qualitative and quantitative preferences (a preliminary study has 
been presented in (ITu et al. 20071 l). and the use of constraints to represent incomplete 
states — e.g., to determine most general conditions for the existence of a plan and to 
support conformant planning ( Son et al. 2007). 

• An interesting line of research is represented by the application of the approach 
discussed here to multi-agent systems. In that case, besides admitting the execu- 
tion of more that one action in each state transition (cf., Remark [TJ, other impor- 
tant issues have to be addressed, since different agents may compete or collaborate 
in order to reach the desired results. For instance, concurrency of actions may be 
subject to constraints to model incompatibilities or interdependences among the oc- 
currences/effects of different actions executed by different agents (even in different 
points in time). Hence, the action description language, as well as its CLP encoding, 
has to be suitably enriched in order to deal with these aspects. A first step in this 
direction has been presented in (IDovier et al. 200 9b). 
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Appendix A Some of the codes of the Experimental Section 

A.l The Three-Barrel Problem: B AIV description of the 12-7-5 barrels problem 

The B MV encoding of the three barrels planning problem for N = 12. (Figure [T]presents 
an encoding using the language £>.) 

barrel (5) . 
barrel (7) . 
barrel (12) . 

f luent (cont (B) , 0, B) :- barrel (B) . 

action (fill (X, Y) ) :- barrel (X) , barrel (Y) , neq(X,Y). 

causes (fill (X, Y) , cont (X) eq 0, [Y-cont (Y) geq cont (X) ] ) :- 

action (fill (X, Y) ) . 
causes (fill (X, Y) , cont (Y) eq cont ( Y) " ( -1 ) +cont (X) * (-1 ) , 
[Y-cont (Y) geq cont (X) ] ) :- 

action (fill (X, Y) ) . 
causes (fill (X, Y) , cont (Y) eq Y, [Y-cont (Y) It cont (X) ] ) :- 

action (fill (X,Y) ) . 
causes (fill (X, Y) , cont (X) eq cont (X) " ( -1 ) -Y+cont ( Y) " (-1 ) , 
[Y-cont (Y) It cont (X) ] ) :- 

action (fill (X, Y) ) . 

executable (fill (X, Y) , [cont(X) gt 0, cont (Y) It Y] ) :- 
action (fill (X, Y) ) . 

caused([], cont(12) eq 12-cont ( 5 } -cont ( 7 } ) . 

initially (cont (12) eq 12). 



goal (cont (12) eq cont (7)) 
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A.2 The HP Protein Folding Problem 

B MV encoding of the HP-protein folding problem with pivot moves on input of the form 
1001001001. . . starting from a vertical straight line. 

length (10) . 

amino (A) : - length (N) , interval (A, 1, N) . 
direction (clock) . 
direction (antick) . 

fluent (x (A) , 1,M) : - 

length (N), M is 2*N, amino (A) . 
fluent (y (A) , 1,M) :- 

length (N), M is 2*N, amino (A) . 
fluent (type (A) ,0,1) :- 

amino (A) . 
fluent ( saw, , 1 ) . 

action (pivot (A, D) ) : - 
length (N) , amino (A) , 
1<A, A<N, direction (D) . 

executable (pivot (A, D) , []) :- action (pivot (A, D) ) . 

causes (pivot (A, clock) , x(B) eg x (A) " ( -1 ) +y (B) " ( -1 ) -y (A) " ( -1 ) , []) :- 

action (pivot (A, clock) ) , amino (B) , B > A. 
causes (pivot (A, clock) , y(B) eg y (A) * ( -1 } +x (A) ' ( -1 } -x (B) " ( -1 } , []) :- 

action (pivot (A, clock) } , amino (B) , B > A. 
causes (pivot (A, antick) , x (B) eq x(A)"(-l)-y(B)'(-l)+y(A)'(-l), []} :- 

action (pivot (A, antick) ) , amino (B) , B > A. 
causes (pivot (A, antick) , y (B) eq y(A)"(-l)-x(A)"(-l)+x(B)*(-l), []) :- 

action (pivot (A, antick) ) , amino (B) , B > A. 

caused([x(A) eg x(B), y (A) eg y (B) ] , saw eg 0) :- 
amino (A) , amino (B) , A < B. 

initially (saw eg 1) . 

initially (x (A) eg N) :- length(N}, amino (A) . 
initially (y (A) eg Y) :- length(N), amino (A) , Y is N+A-l . 
initially (type (X) eg 1 ) : - amino (X) , X mod 3 = : = 1 . 
initially (type (X) eg 0) :- amino (X) , X mod 3 =\= 1. 

goal (saw gt 0) . 

state.cost (FE) :- length (N) , auxc (1, 4,N,FE) . 

auxc (I, J,N, 0) : - I > N-3, ! . 

auxc (I, J,N, FE} :- J > N, !, 11 is 1+1, 

Jl is 11+3, auxc (II, J1,N, FE) . 
auxc (I, J,N,FEl+type (I) *type (J) *rei (abs (x(I) -x (J) ) +abs (y(I) -y (J) ) eg 1)} :- 

Jl is J+2, auxc (I, J1,N, FE1) - 

always (x (1) eg 10} . always (y (1) eg 10} . 
always (x (2) eg 10} . always (y (2) eg 11} . 

cost.constraint (goal geg 4). 
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A3 The Community Problem 

A.3.1 B description of the instance A4 



max.people ( 4 ) . 

person(X) :- max.people (N) , interval (X, 1 , N) . 

money(X) :- max.people (N) , M is N*{N+1), interval (X, 1,M) . 

fluent (owns (B, M) ) :- person (B) , money (M) . 

action (gives (X, Y) ) : - 

person (X) , person(Y), neq(X,Y). 

executable (gives (X, Y) , [ owns (X, Mx) ] ) : - 
action (gives (X, Y) ) , 
fluent (owns (X, Mx) ) , Mx > X. 

causes (gives (X, Y) , owns (X, NewMx) , [owns (X, Mx) ] ) : - 
action (gives (X, Y) ) , money (Mx) , 
fluent (owns (X, NewMx) ) , fluent (owns (X, Mx) ) , 
NewMx is Mx-X. 

causes (gives (X, Y) , owns (Y, NewMy) , [owns (Y, My) ] ) : - 
action (gives (X, Y) ) , money (My) , 
fluent (owns (Y, NewMy) ) , fluent (owns (Y, My) ) , 
NewMy is My+X . 

caused ( [owns (X, Mx) ] , neg (owns (X, Other) ) ) : - 
fluent (owns (X, Mx) ) , fluent (owns (X, Other) ) , 
person (X) , money (Mx) , money (Other) , neq (Mx, Other) . 

initially (owns (X, M) ) : - 
person (X) , M is 2*X. 

goal (owns (X, Mid) ) : - 

person (X), max.people (N) , Mid is (N* (N+l ) ) / /N . 



A.3.2 B description of the instance A4 



max.people ( 4 ) . 

person (X) :- max_people (N) , interval (X, 1, N) . 

fluent (owns (B) , 1,M) :- 

person (B) , max.people (N) , M is N* (N+l ) . 

action (gives (X, Y) } : - 

person(X), personfY) , neq(X,Y). 

executable (gives (X, Y) , [owns (X) gt X]} :- 
action (gives (X, Y) ) . 

causes (gives (X, Y) , owns (X) eq owns (X) * (-1) -X, []) : - 

action (gives {X, Y) ) . 

causes (gives (X, Y) , owns (Y) eq owns ( Y) " ( -1 } +X, []) :- 

action (gives (X, Y) ) . 

initially (owns (X) eq M) :- 
person (X) , M is 2*X. 



goal (owns (X) eq Mid} :- 

person (X), max_people (N) , Mid is (N* (N+l ) ) / /N . 



Multi-valued Action Languages in CLP(FD) 

A.4 The 8-Tile Puzzle Problem 

A.4.1 B description of the instance h 



cell(X) : 
val(X) :- 
near (1,2) 
near (2, 1) 
near (3,2) 
near (4, 1) 
near (5,2) 
near (6,3) 
near (7,4) 
near (8, 5) 
near (9, 6) 



- interval (X, 1, 9) . 
interval (X, 1, 9) , neq(X,3). 
near (1, 4) 



near (2, 3} 
near (3,6} 
near (4,5} 
near (5, 4) 
near (6,5} 
near (7, 8) 
near (8, 7) 
near (9, 8) 



near (2,5) . 

near (4, 7) . 
near (5,6) . 
near (6, 9) . 

near (8, 9) . 



near (5,8) . 



fluent (at (X, Y) ) 
fluent (free (Y) ) 



val(X), cell(Y) 
cell (Y) . 



action (move (X, Y) ) 



val (X) , cell (Y) . 



executable (move (X, Y) , [at (X, Z), f ree (Y) ] ) 
val(X), cell(Y), cell(Z), near(Z,Y). 



causes (move (X, Y) , 
val(X), cell(Y) 

causes (move (X, Y) , 
val(X), cell(Y) 

caused! [at (X,Y) ] , 
val(X), cell(Y) 

caused! [at (X,Y) ] , 
val(X), cell(Y) 

caused! [at (X,Y) ] , 
val (X) , val (W) , 

initially (at (1,1)) 
initially (at (5,2)) 
initially (at (8,6)) 
initially (neg (at (1 
initially (neg (at (2 
initially (neg (at (4 
initially (neg (at (5 
initially (neg (at (6 
initially (neg (at (7 
initially (neg (at (8 
initially (neg (at (9 
initially (neg (free 



at(X,Y), []) :- 

free(Z), [at (X, Z) ] ) 
. cell (Z) . 
neg (free (Y) ) ) :- 



neg(at (X,Z) ) ) :- 
. cell(Z), neq(Y.Z) 
neg (at (W, Y) ) ) :- 
cell (Y) , neq (X,W) . 



initially (at (2, 
initially (at (6, 
initially (at (9, 



3) ) . 
9) ) . 
7) ) . 



initially (at (4,8)). 
initially (at (7,4)). 
initially (free (5) ) . 



,X) ) 
,X) ) 
,X) ) 
,X) ) 
,X) ) 
,X) ) 
,X) ) 
,X) ) 
(X) ) 



cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 
cell (X) , 



neq(X, 1) 
neq(X, 3) 
neq(X, 8) 
neq(X,2) 
neq(X, 9) 
neq(X, 4) 
neq (X, 6) 
neq(X, 7) 
neq(X, 5) 



goal (at (X,X) ) :- val (X) 
goal (free (3) ) . 
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A.4.2 B description of the instance I\ 



cell(X) :- interval (X, 1, 9) . 

tile(X) :- interval (X, 1, 9) , neq(X,3). 

near (1,2) . near (1,4) . 

. . . las for B . . . 
near (9, 6) . near (9,8) . 
fluent (at (X) , 1, 9) :- tile (X) . 
fluent (free, 1,9). 

action (move (X, Y) ) :- cell (Y) , tile (X) . 

executable (move (X, Y) , [at(X) eq z, free eq Y] ) :- 

tile(X), cell(Y), near(Z,Y). 
causes (move (X, Y) , at (X) eq Y, []) :- 

tile (X) , cell (Y) . 
causes (move (X,Y) , free eq at(X)~(-l), []) :- 

tile (X) , cell (Y) . 

initially (at (1) eq 1). initially (at (2) eq 3). 

initially (at (4) eq 8). initially (at (5) eq 2). 

initially (at (6) eq 9). initially (at ( 7 ) eq 4). 

initially (at (8) eq 6). initially (at ( 9 ) eq 1). 

initially ( free eq 5) . 

goal (at (X) eq X) :-tile(X). 

goal (free eq 3) . 



Multi-valued Action Languages in CLP(FD) 



A.5 The Wolf-Goat-Cabbage Problem 

A. 5.1 B description of the Wolf- goat-cabbage problem 



obj (goat) . 
obj (cabbage) . 
obj (wolf) . 
obj (man) . 

side (left) . side (right) . 
pos (X) :- side (X) . 
pos (boat) . 



fluent (is.in (X, Y) ) :- 
fluent (boat.at (Y) ) :- 
fluent (alive) . 

action (sail (A, B) ) : - 
action (go.aboard (A) ) 
action (get .off (A) ) : - 

executable (sail (A, B) 
side (A) , side (B) , 



obj (X) , pos (Y) 
side (Y) . 



side (A) , 
:- obj (A) 
Obj (A) . 



side (B) , neq (A, B) . 



[boat.at (A) , 
neq (A, B) 



is.in (man, boat) ] ) 



executable (go_aboard (A) , [boat.at (L) , is.in (A, L) ] ) 

obj (A) , side (L) . 
executable (get .off (A) , [is.in (A, boat } ] ) : - 

obj (A). 

causes (sail (A, B) , boat_at (B) , []) :- 

side(A), side(B), neq(A,B). 
causes (go.aboard (A) , is.in (A, boat) , []) :- 

obj (A) . 

causes (get .off (A) , is.in (A, L) , [boat.at (L) ] } 
obj (A) , side (L) . 



caused ( [ is.in (Ogg, LI ) 

obj (Ogg) , pos (Ll) , 
caused ( [boat_at (Ll) ] , 

side (Ll) , side (L2) 
caused ( [ is.in (A, boat ) 

obj (A) , obj(B), 
caused ( [is.in (wolf 

pos (L) . 
caused ( [is.in (cabbage, L) , 

pos (L) . 

initially (is.in (A, left) ) 
initially (alive) . 
initially (boat_at (left) } . 



, neg (is.in (Ogg, L2 )} ) :- 
pos (L2) , neq(Ll,L2) . 
neg (boat_at (L2) ) ) : - 
neq (Ll, L2) . 

is.in (B, boat) ] , neg (alive) ) 
diff (A,B,man) . 
L) , is.in (goat, L) 



neg (is.in (man, L) } ] , neg (alive) ) 



is.in (goat, L) , neg (is.in (man, L) } ] , neg (alive) ) 



obj (A) . 



goal (is.in (A, right) ) :- obj (A) 
goal (alive) . 
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A.5.2 B description of the Wolf- goat-cabbage problem 

obj (goat) . 
obj (cabbage) . 
obj (wolf) . 
obj (man) . 

% 0=boat, l=on-the-left, 2=on-the-right : 

fluent (is_in(X), 0,2) :- obj (X) . 
fluent (boat.at, 1, 2) . 
fluent (alive, 0,1). 

action (sail) . 

action (go.aboard (A) ) :- obj (A) . 
action (get.off (A) ) :- obj (A} . 

executable {sail, [ is_in (man) eg 0]). 
executable (go.aboard (A) , [boat.at eg is_in(A)]} :- 
obj (A) . 

executable (get.off (A) , [is_in(A) eq 0]) :- 
obj (A) . 

causes {sail, boat.at eg 1, [boat.at eg 2]). 
causes {sail, boat_at eg 2, [boat.at eg 1] ) . 
causes (go_aboard (A) , is_in (A) eq 0, [ ] ) : - 
obj (A) . 

causes (get.off (A) , is_in (A) eq boat.at * (-1 ) , []) :- 
obj (A) . 

caused ([ is_in (A) eq 0, is_in(B) eq 0], alive eg 0} :- 

obj (A) , obj(B), diff (A,B,man) . 
caused { [is_in (wolf) eg is_in (goat) , 

is_in (man) neq is_in (wolf) ] , alive eq 0} . 
caused ( [is_in (cabbage) eq is_in (goat ) , 

is_in(man) neq is_in (cabbage) ] , alive eg 0). 

initially ( is_in (A) eq 1} :- obj (A) . 
initially (boat_at eq 1). 
initially (alive eq 1) . 

goal (is_in (A) eg 2) :- obj (A) . 
goal (alive eg 1) . 
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A.6 The Gas-diffusion Problem: B MV description of the instance A 4 



room(N) :- interval (N, 1 , 11 ) . 

gate (1,2) . 

gate (1,7) . 

gate (1, 11) . 

gate (2,3) . 

gate (3,4) . 

gate (4,5) . 

gate (5,6) . 

gate (6,7) . 

gate (6,8) . 

gate (8,9) . 

gate (9,10) . 

gate (10, 11) . 

fluent (contains (N) , 0, 255) :- room(N). 
fluent (is.open (X,Y) , 0, 1) :- gate(X,Y). 

action(open(X,Y) ) :- gate(X,Y). 
action (close (X, Y) ) :-gate(X,Y). 

executable (open (X, Y) , L) :- 
action (open (X, Y) ) , 

findall ( (is.open (X, Z) eq 0), gate (X, Z ) , LI ) , 
findall ( (is.open (Z, X) eq 0), gate ( Z, X) , L2 ) , 
findall ( (is.open (Y, Z) eq 0), (gate (Y, Z) , neq (Z, X) ) , L3) , 
findall ( (is.open (Z, Y) eq 0), (gate ( Z , Y) , neq ( Z , X) ), L4 ) , 
append (LI , L2 , La ) , append ( L3 , L4 , Lb) , append (La, Lb, L) . 
executable (close (X, Y) , [ is.open (X, Y) eq 1]} :- 
action(close(X,Y) ) . 

causes (open (X, Y) , 

contains (Y) eq (contains (X) " (-1) tcontains (Y) * (-1) ) /2, 
[]) :- 
action (open (X, Y) ) . 
causes (open (X, Y) , 

contains (X) eq ( contains (X) " (-1 ) tcontains (Y) "( -1 )) 12 , 
[]) :- 
action(open(X,Y) ) . 
causes (open (X, Y) , is.open (X,Y) eq 1, []} :- 

action(open(X,Y) ) . 
causes (close (X, Y) , is_open(X,Y) eq 0, []} :- 
action(close(X,Y) ) . 

initially (is.open (X, Y) eq 0) :- gate(X,Y). 
initially (contains (10) eq 128). 
initially (contains (3) eq 128). 

initially (contains (A) eq 0) :- room(A), dif f (A, 3, 10) . 
goal (contains (1) gt 50). 



